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GRAPT^VTNF T.E&yROT .T. VTRUS PROTEINS AND THEIR USES 

This work was supported by United States -Israel 
Binational Agricultural Research and Development Fund Grant 
No. US -1737- 89 and by the United States Department of 
Agriculture Cooperative Agreement No. 58-2349-9-01. The 
Federal Government may have certain rights in the invention'. 

FIELD OF THE INVENTION 

The present invention relates to grapevine leafroll virus 
proteins, DNA molecules encoding these proteins, and their 
uses. 

!5 BACKGROUND OF THE INVENTION 

The world's most widely grown fruit crop, the grape 
[Vitis sp.) , is cultivated on all continents except 
Antarctica. Major grape production centers are in European 

20 countries (including Italy, Spain, and France) , which 

constitute about 70% of the world grape production (Mullins et 
al., PHnlocrv of the Grapevine . Cambridge, U.K., University 
Press (1992)). The United States is the eighth largest grape 
grower in the world. Although grapes have many uses, a major 

25 portion of grape production (-80%) is used for wine 

production. Unlike cereal crops, most of the world's 
vineyards are planted with traditional grapevine cultivars, 
which have been perpetuated for centuries by vegetative 
propagation. Several important grapevine virus and virus -like 

3 0 diseases, such as grapevine leafroll, corky bark, and 

Rupestris stem pitting, are transmitted and spread through the 
use of infected vegetatively propagated materials. Thus, 
propagation of certified, virus -free materials is one of the 
most important disease control measures. Traditional breeding 

3 5 for disease resistance is difficult due to the highly 

heterozygous nature and outcrossing behavior of grapevines, 
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and due to polygenic patterns of inheritance. Moreover, 
introduction of a new cultivar may be prohibited by custom or 
law. Recent biotechnology developments have made possible the 
introduction of special traits, such as disease resistance, 
into an established cultivar without altering its 
horticultural characteristics. 

Many plant pathogens, such as fungi, bacteria, 
phytoplasmas, viruses, and nematodes can infect grapes, and 
the resultant diseases can cause substantial losses in 
production (Pearson et al., Compendium of Grape r)iRP^ f?f >< ? | 
American Phytopathological Society Press (1S88)). Among 
these, viral diseases constitute a major hindrance to 
profitable growing of grapevines. About 34 viruses hava been 
isolated and characterized from grapevines. The major virus 
diseases are grouped into: (1) the grapevine degeneration 
caused by the fanleaf nepovirus, other European nepoviruses, 
and American nepoviruses, (2) the leafroll complex, and (3) 
the rugose wood complex (Martelli, ed., Graft Transmi^^^io 
Diseases of Grapevines, Handbook for Detection an d Pi agnncjj ^ j 
FAO, UN, Rome, Italy (1993)). 

Grapevine leafroll complex is the most widely distributed 
of the major diseases of grapes. According to Goheen (Goheen, 
"Grape Leafroll," in Frazier et al., eds., Virus nis^ac^ 0 f 
Small Frrnfs and Grappvi nes (A Handbook), University of 
California, Division of Agricultural Sciences, Berkeley, 
Calif, USA, pp. 209-212 (1970), grapevine leaf roll-like 
disease was described as early as the 1850s in German and 
French literature. The viral nature of the disease and graft 
transmission were first demonstrated by Scheu (Scheu, D . n . 
Weinbau 14:222-358 (1935) . In 1946, Harmon and Snyder (Harmon 
et al., Proc. Am. Soc . Hor t . Sci . 74:190-194 (1946)) 
determined the virus nature of White Emperor disease in 
California. It was later proven by Goheen et al . (Goheen et 
al., Phvropatholocrv , 48:51-54 (1958)) that both leafroll and 
"White Emperor" diseases were -the same, and only the name 
"leafroll" was retained. 
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Leafroll is a serious virus disease of grapes and occurs 
wherever grapes are grown. This wide distribution of the 
disease has come abouz through the propagation of diseased 
vines. It affects almost all cultivated and rootstock 
5 varieties of Vitis. Although the disease is not lethal, it 
causes yield losses and reduced sugar content, Scheu 
estimated in 1936 that 6 0 per cent of all grapevines planted 
in Germany were infected (Scheu, Ntein wingerbuch. Berlin, 
Reichsnahrstand-Verlags (1936)). In many California wine 

10 grape vineyards, the incidence of leafroll (based on a 1959 
survey of field symptoms) agrees with Scheu f s initial 
observation in German vineyards (Goheen et al. f Amg-r. j, Fpol , 
vitic , . 10:78-84 (1959)). The current situation on leafroll 
disease appears similar (Goheen, THq American 

15 Phyt-opathnloaical Society . St. Paul, Minnesota : APS Press, 

1:47-54 (1988). Goheen estimated that the disease causes an 
annual loss of about 5-20 per cent of the total grape 
production (Goheen (1970); Goheen (1988)). The amount of 
sugar in individual berries of infected vines is only about 

2 0 1/2 to 2/3 that of berries from noninfected vines (Goheen 

(1958) ) . 

Symptoms of leafroll disease vary considerably depending 
upon the cultivar, environment, and time of the year. On red 
or dark-colored fruit varieties, the typical downward rolling 
25 and interveinal reddening of basal, mature leaves is prevalent 
in autumn and is less apparent in spring or early summer. On 
light -colored fruit varieties, symptoms are less conspicuous, 
usually downward rolling accompanied by interveinal chlorosis . 
Moreover, many infected rootstock cultivars do not develop 

3 0 symptoms. In these cases, the disease is usually diagnosed 

with a woody indicator indexing assay using Vitis vivifera cv. 
Carbemet Franc (Goheen (1988) ) . 

Ever since Scheu demonstrated that leafroll was graft 
transmissible, a virus etiology has been suspected (Scheu 
35 (1935)). Several virus particle types have been isolated from 
leafrcll diseased vines. These include potyvirus-like (Tanne 
et al. f Phytopathology , €7:442-447 (1977)), isometric virus- 
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like (Castellano et al., Eis, 22:23-39 (1983)) and Namba et 
al., Ann. Phvtnpathol .^nr. Japan. 45:70-73 (1979)), and 
closterovirus-like (Namba, An^ . Phvtonat-hn l . ,g^ r ,7^^ 
45:497-502 (1979)) particles. In recent years, however, long 
flexuous closteroviruses ranging from 1,400 to 2,200 nm in 
length have been most consistently associated with leafroll 
disease (Castellano (1983), Faoro et al . , Riv. Patnl . yog 
Ser IV/ 17:183-189 (1981); Gugerli et al . 7 Rev. Sin^o 
Vitinulf, Arboric ult. Kort . . 16:299-304 (1984); Hu et al . , 
PhytOpathQl . , 128:1-14 (1990); Milne et al . , Phvtopafhol . % 
110:360-368 (1984) ; Zee et al., Phvtopathnlnpy, 77:1427-1434 
(1987); Zimmermann et al., J. Phvtonathnl r 130:205-218 
(1990) . These closteroviruses are referred to as grapevine 
leafroll associated viruses (GLRaV) . At least six 
serologically distinct types of GLRaV 1 s ( GLRaV- 1 to -6) have 
been detected from leafroll diseased vines (Table 1) (Boscia 
et al., Vitiff, 34:171-175 (1995); (Martelli, "Leafroll," pp. 
37-44 in Martelli, ed., Graft Transmiss i ble DjRPRses nf 
Grapevines. Handbook for Detection and Diacmn.qifi r FAO, Rome 
Italy, (1993)). The first five of these were confirmed in the 
10th Meeting of the International Council for the Study of 
Virus and Virus Diseases of the Grapevine (ICVG) (Volos, 
Greece, 1990) . Through the use of monoclonal antibodies, 
however, the original GLRaV II described in Gugerli (1984) has 
been shown to be an apparent mixture of at least two 
components, Ila and lib (Gugerli et al., "Grapevine Leafroll 
Associated Virus II Analyzed by Monoclonal Antibodies," llth 
Meeting of the International Council for the sniri y of vimsps 
and Virus Diseases of the Grapevine/ Montreux, Switzerland, 
pp. 23-24 (1993)). Recent investigation with comparative 
serological assays (Boscia (1995)) demonstrated that the lib 
component of cv. Chasselas 8/22 is the same as the GLRaV- 2 
isolate from France (Zimmermann (1990)) which also include the 
isolates of grapevine corky bark associated closteroviruses 
from Italy (GCBaV-BA) {Boscia (1995)) and from the United 
States (GCBaV-NY) (Namba et al., Phytopathology - 81:964-970 
(1991)). The Ila component of cv. Chasselas 6/22 was given 
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the provisional name of grapevine leafroll associated virus 6 
(GLRaV-6) . Furthermore, the antiserum to the CA-5 isolate of 
GLRaV-2 produced by Boscia et al. (Boscia et al., 
Phytopathology. 80:117 (1990)) was shown to contain antibodies 
to both GLRaV-2 and GLRaV-1, with a prevalence of the latter 
(Boscia (1995)). 

Several shorter closteroviruses (particle length BOO nm 
long) have also been isolated from grapevines. One of these, 
called grapevine virus A (GVA) has also been found associated, 
though inconsistently, with the leafroll disease (Agran et 
al., Vitis, 29:43-48 (1990); Conti, et al., Phvtopai-Vi 0 1 , 
Mediterr . . 24:110-113 (1985); Conti et al., Phvt opal- hoi ngv j 
70:394-399 (1980)). The etiology of GVA is not really known; 
however, it appears to be more consistently associated with 
rugose wood sensu lato (Rosciglione at al., Rev. Sinssf* Viri^ 
ArbQric, Hortic, 18:207-211 (1986) ; and Zimmermann (1990)). 
Another short closterovirus (800 nm long) named grapevine 
virus B (GVB) has been isolated and characterized from corky 
bark-affected vines (Boscia et al., Arch . Virol . r 130:109-120 
(1993) ; Namba (1991) ) . 

As suggested by Martelli, leafroll symptoms may be 
induced by more than one virus or they may be simply a general 
plant physiological response to invasion by an array of 
phloem-inhabiting viruses. Grapevine leafroll is induced by 
one (or a complex) of long closteroviruses (particle length 
1, 400 to 2,200 nm) . 

Grapevine leafroll is transmitted primarily by 
contaminated scions and rootstocks. However, under field 
conditions, several species of mealybugs have been shown 
vectors of leafroll (Engelbrecht et al., PhvtophylacrtH ra r 
22:341-346 (1990); Engelbrecht et al., Phytophylart i <-p , 
22:347-354 (1990); Rosciglione, et al., (Abstract), 
Phyt Parasitica, 17:63-63 (1989); and Tanne, Phvt.onarasit i ca , 
16:288 (1988)). Natural spread of leafroll by insect vectors 
is rapid in various parts of the world. In New Zealand, 
observations of three vineyards showed that the number of 
infected vines nearly doubled in a single year (Jordan et al . , 
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11th Mating cf the Inr^m^ioTial Council for t- he firurty ff f 
Viruses and Virus Disea ^g of the Grapevine r Montreux, 
Switzerland, pp. 113-114 (1993)). One vineyard became 90V 
infected 5 years after GLRaV-3 was first observed. Prevalence 
of leafroll worldwide may increase as chemical control of 
mealybugs becomes more difficult due to the unavailability of 
effective insecticides. 

In view of the serious risk grapevine leafroll virus 
poses to vineyards and the absence of an effective treatment, 
there is a need to prevent this disease and the resulting 
economic losses. The present invention overcomes this 
deficiency in the art . 

SUMMARY OF INVENTION 

The present invention relates to an isolated protein or 
polypeptide corresponding to a protein or polypeptide of ? 
grapevine leafroll virus. The encoding RNA and DNA molecules, 
in either isolated form or incorporated in an expression 
system, vectors, host cells, and transgenic Vitis or citrus 
scions or rootstock cultivars, are also disclosed. 

Another aspect of the present invention relates to a 
method of imparting grapevine leafroll virus resistance to 
Vitis scion or rootstock cultivars by transforming them with a 
DNA molecule encoding a protein or polypeptide of a grapevine 
leafroll virus. These DNA molecules can also be used in 
transformation of citrus scion or rootstock cultivar to impart 
tristeza virus resistance to such cultivars. 

The present invention also relates to an antibody or 
binding portion thereof or probe which recognizes the protein 
or polypeptide or the nucleic acid encoding same. 

Grapevine leafroll virus resistant transgenic variants of 
the current commercial grape cultivars and rootstocks allows 
for improved control of the virus while retaining the varietal 
characteristics of specific cultivars. Furthermore, these 
variants permit control of GLRaV transmitted by contaminated 
scions or rootstocks or by GLRaV- carrying mealybugs or other 
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insect pests. With respect to the latter mode of 
transmission, the present invention circumvents increasingly 
restricted pesticide use, which has made chemical control of 
mealybug infestations increasingly difficult. Thus, the 
interests of the environment and the economics of grape 
cultivation and wine making are benefited by the present 
invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figures 1A-1B, illustrates the results of Northern blot 
hybridization. Figure IB shows that probe made from a clone 
insert gave positive reaction with itself (lane 3) and to 
dsRNA from leafroll infected tissues (lane 1) , but not with 
nucleic acids extracted from healthy grapevines (lane 2) 
Lane M contains molecular weight markers (Hindlll digested 
lambda DNA) . Figure 1A depicts an ethidium bromide stained 
agarose gel before transfer to a membrane . 

Figure 2 presents an analysis of GLRaV-3 dsRNA by 
electrophoresis on an ethidium bromide stained agarose gel. A 
dsRNA of ca. 16 kb was readily isolated from diseased 
grapevine (lane 6), but not from the healthy control (lane 5) 
Other samples that were used for control were tobacco mosaic 
virus dsRNA (lane 1); cucumber mosaic virus dsRNA (lane 2) ; 
pBluescript vector (lane 3) and an insert of clone pC4 . A 
Hindi 1 1 digested lambda DNA was used as molecular weight 
markers (lane M) . 

Figure 3 is a Western blot of antibodies to GLRaV-3 that 
reacted to proteins produced by cDNA clones after IPTG 
induction in E. coli. Similar banding patterns were observed 
whether a polyclonal (panel A) or a monoclonal antibody 
(panel B) was used. Lane 1 shows clone pCP10-l; lane 2, pC?5 ; 
lane 3, pCP8-4; and lane 4, the native coat protein from 
GLRaV-3 infected tissue. Lane M is a prestained protein 
molecular weight marker. 

Figure 6 shows the cDNA clones containing the coding 
region for the coat protein of the NY1 isolate of GLRaV-3 . 
Three clones (pCP8-4, pCP5, pC?lC-l) were identified by 
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inmunoscreening a cDNA library prepared in lambda ZAP II. Two 
ocher clones were aligned after plaque hybridization and 
nucleotide sequencing. The coat protein ORF is shown by an 
arrow in an open rectangle. 

Figure 5 is the phylogenetic tree generated using results 
obtained using the Clustal Method of MegAlign program in 
DNASTAR for the coat protein of GLRaV-3 . The coat protein of 
GLRaV-3 was incorporated into a previously described alignment 
(Dolja et al., Ann. Rev. Phytopathol . f 32:251-285 (1994)) for 
comparison. The other virus sequences were obtained from 
current databases; apple chlorotic leaf spot virus (ACLSV) ; 
apple stem grooving virus (ASGV) ; apple stem pitting virus 
(ASPV) ; barley yellow mosaic virus (BaMV) ; beet yellows 
closterovirus (BYV) ; diverged copies of BYV and CTV coat 
proteins (BYV p24 and CTV p27, respectively); citrus tristeza 
virus (CTV) ; grapevine virus A (GVA) ; grapevine virus B (GVB) ; 
lily symptomless virus (LSV) ; lily virus X (LVX) ; narcissus 
mosaic virus (NMV) ; pepper mottle virus (PeMV) ; papaya mosaic 
virus (PMV) ; potato virus T (PVT) ; potato virus S (PVS) ; 
potato virus M (PVM) ; potato virus X (PVX) ; tobacco etch 
virus (TEV) ; tobacco vein mottle virus (TVMV) ; and white 
clover mosaic virus (WcMV) . 

Figure 6 depicts an analysis of reverse transcription 
polymerase chain reaction (RT-PCR) to detect GLRaV-3 in a 
partially purified virus preparation. The original sarnnle 
concentration is equivalent to 50 mg//xl of phloem tissue (lane 
1) which was diluted by 10-fold series as 10" 1 (lane 2) , 10" 2 
(lane 3), 10' 3 (lane 4), 10"* (lane 5), and 10" s (lane 6), 
respectively. The expected 219 bp PCR product was clearly 
observed up to lane 4, which is equivalent to a detection 
limit of 10 fig of phloem tissue. Lane 7 was a healthy 
control. Lane 8 was dsRNA for positive control. Lanes 9-11 
were also used for positive controls of purified viral RNA 
(lane 9) , dsRNA (lane 10) , and plasmid DNA (pC4) (lane 11) as 
templates, respectively. Lane -M contains molecular weight 
markers (tfaelll digested fx 174 DNA) . 
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Figures 7A-7B depicts comparative analysis of Nested PCR 
with immuno- capture preparations on field collected samples. 
Using a polyclonal antibcdy to GLRaV-3 for immune - capture , the 
expected 64 8 bp PGR product was not consistently observed in 
the first round of PCR amplification with external primers 
over a range of samples (lanes 1-7, Figure 7A) . However, the 
expected 219 bp PCR product amplified by internal primers was 
consistently observed over all seven samples (lanes 1-7, 
Figure 7B) . A similar inconsistency is also shown in a sample 
prepared by proteinase K- treated crude extract (compare panels 
A to B on lane 8) . With dsRNA as template, the expected PCR 
products were readily observed in both reactions (compare lane 
10 in Figure 7A and 7B) . No such products were observed on a 
healthy sample (lane 9) . Lane M contained molecular weight 
markers (Haelll digested fx 174 DNA) . 

Figures 8A-8B depict comparative studies on the 
sensitivity of Nested PCR with samples prepared using 
proteinase K-treated crude extract (Figure 8A, PK Nested PCR) 
and by immuno-capture preparation (Figure 8B, IC Nested PCR) . 
Nested PCR was performed on samples with serial 10 -fold 
dilutions of up to 10" fi in a proteinase K-treated and 10** in an 
immuno-capture preparation. The expected 219 bp PCR product 
was observed up to 10" 5 in PK Nested PCR and over 10" a (the 
highest dilution used in this test) in IC Nested PCR. A 
similar PCR product was also observed with dsRNA template but 
not healthy grape tissues (H. CK) . Lane M contained molecular 
weight markers (Haelll digested fX 174 DNA) . 

Figure 9 shows partial genome organization for GLRaV-3 
and the cDNA clones used to determine nucleotide sequence. 
Numbered lines represent nucleotide coordinates in kilobases 
(kb) . 

Figure 10 depicts the proposed genome organization of the 
GLRaV-3 in comparison with three other closterovirus genomes, 
BYV, CTV, and LIYV (Dolja (1994)). Homologous proteins are 
shown by identical patterns. Papain-like proteinase (P-PRO) ; 
methyl transferase of type 1 (MTR1) ; RNA helicase of 
superfamily 1 (HELD ; RNA polymerase of supergroup 3 (PL03); 
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HSP70 -related protein (HSP70r) ; and capsid protein forming 
filamentous virus particle (CPf ) , 

Figure 11 is the phylogenetic tree showing the amino acid 
sequence relationship of the helicase of alphaviruses . The 
5 helicase domain of GLRaV-2 (291 aa) from the present study is 
used. The other virus sequences were obtained from current 
databases (Swiss-Prot and GenBank, release 84.0). Apple 
chlorotic leaf spot virus (ACLSV) ; broad bean mottle virus 
(BbMV) ; brome mosaic virus (BMV) ; beet yellow closterovirus 

10 (BYV) ; cowpea chlorotic mottle virus (CcMV) ; cucumber mosaic 
virus (CMV) ; fox mosaic virus (FxMV) ; lily symptomless virus 
(LSV) ; lily virus X (LXV) ; narcissus mosaic virus (NMV) ; pea 
early browning virus (PeBV) ; papaya mosaic virus (PMV) ; poplar 
mosaic virus (PopMV) ; peanut stunt virus (PSV) ; potato virus S 

15 (PVS) ; potato virus M (PVM); potato virus X (PVX) ; strawberry 
mild yellow edge-associated virus (Sm Yea V) ; tomato aspermy 
virus (TAV) ; tobacco mosaic virus (TMV) ; tobacco rattle virus 
(TRV) ; and white clover mosaic virus (WcMV) . 

Figure 12 shows the phylogenetic tree for the RNA 

20 dependent RNA polymerases (RdRp) of the alpha -like supergroup 
of positive strand RNA viruses. The deduced amino acid 
sequence of the RdRp of GLRaV-3 was incorporated into a 
previously described alignment (Dolja (1994)) for comparison. 
The other virus sequences were obtained from current 

25 databases : Apple chlorotic leaf spot virus (ACLSV) ; alfalfa 
mosaic virus (A1MV) ; apple stem grooving virus (ASGV) ; brome 
mosaic virus (BMV) ; beet necrotic yellow vein virus (BNYW) ; 
beet yellow virus (BYV) ; barley stripe mosaic virus (BSMV) ; 
beet yellow stunt virus (BYSV) ; cucumber mosaic virus (CMV) ; 

30 citrus tristeza virus (CTV) ; hepatitis E virus (HEV) ; potato 
virus M (PVM) ; potato virus X (PVX) ; raspberry bushy dwarf 
virus (RBDV) ; shallot virus X (SHVX) ; Sinbis virus (SNBV) ; 
tobacco mosaic virus (TMV) ; tobacco rattle virus (TRV) ; and 
turnip yellow mosaic virus (TYMV) . 

35 Figure 13 is the predicted phylogenetic relationship for 

viral and cellular HSP70 proteins. HSP70-related protein of 
GLRaV-3 (p59) was incorporated into a previously described 



WO 97/22700 



11 



PCT/US96/20747 



alignment (Dolja (1994)) for comparison. The sequences of 
BYV, CTV, and LIYV proteins were from Agranovsky et al . , J . 
Qgn. Virol. . 217:603-610 (1991), Pappu et al., Virol ocry, 
199:35-46 (1994), and Klaassen et al., Virology, 208:99-110 
(1995) , respectively. Only the N-terminal half of beet yellow 
stunt virus HSP70-related protein (Karasev et al., J . G&n , 
Virol . . 75:1415-1422 (1994)) is used. Other sequences were 
obtained from the Swiss -Prot database; their accession numbers 
are as follows: DNA1_BACSU, Bacillus subtilis (P13343); 
DNAKJECOLI, Escherichia coli (P04475) ; HS70_CHICK (P08106) ; 
HS70_ONCMY, Oncorhynchus mykiss (P08108) ; HS70_PLACB, 
Plasmodium cynomolgi (Q05746) ; HS70_SCHMA, Schistosoma 
mansoni (P08418) ; HS70_XENLA, Xenopus laevis (P02827) ; 
HS71_DR0ME, Drosophila melanogaster (P02825)'; HS71_HUMAN 
(P08107) ; HS71_MOUSE (P17879); HS71_PIG (P34930) ; HS74_PARLI , 
Paracentrotus lividus (Q06248) / HS74JTRYBB, Trypanosoma brucei 
(P11145) ; and ZMHSP702 , maize gene for heat shock protein 70 
exon 2 (X03697) . 

Figure 14 summarizes the strategies employed in the 
construction of the plant transformation vector p B in 1 9 GLRa V - 
3hsp90-12-3. A plant expression cassette, in the Hindlll- 
EcoRI fragment containing CaMV 35S-35S promoters-AMV 5 ! 
untranslated sequence-43K ORF-Nos 3' untranslated region, was 
excised from pBI525GLRaV-3hsp90 and cloned into the similarly 
(restriction enzyme) treated plant transformation vector 
pBinl9. The resulting clone, pBinl9GLRaV-3hsp90-12-3 , is 
shown* Locations of important genetic elements within the 
binary plasmid are indicated: BR, right border; BL, left 
border; Nos-NPT II, plant expressible neomycin 
phosphotransferase gene; Lac - LAC Z, plant expressible Lac Z 
gene; and Bacterial Kan, bacterial kanamycin resistance gene. 

Figure 15 shows the Agrobacterium-binary vector 
pGA4 82G/cpGLRaV-3 , which was constructed by cloning the 
Hindi 1 1 fragment of pEPT8cpGLRaV-3 into a derivative of pGA4 82 
and used for transformation via Agrobacterium or Biolistic 
approach. 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to isolated DNA molecules 
encoding the proteins or polypeptides of a grapevine leafroll 
virus. A substantial portion of the grapevine leafroll virus 
genome, within which are a plurality of open reading frames, 
has been sequenced by the present inventors. One such DNA 
molecule contains an open reading frame encoding grapevine 
leafroll virus helicase and comprising the nucleotide sequence 
corresponding to SEQ ID NO:l. The helicase has an amino acid 
sequence corresponding to SEQ ID NO: 2 and a molecular weight 
from about 146 to about 151 kDa, preferably about 148,5 kDa. 

Another such DNA molecule comprises an open reading frame 
which codes for a grapevine leafroll virus RNA-dependent RNA 
polymerase and comprises the nucleotide sequence corresponding 
to SEQ ID NO: 3. The RNA-dependent RNA polymerase has an amino 
acid sequence as given in SEQ ID NO: 4 and a molecular weight 
from about 59 to about 63 kDa, preferably about 61 kDa. 

Another such DNA molecule comprises an open reading frame 
which codes for a grapevine leafroll virus hsp70 -related 
protein or polypeptide and comprises the nucleotide sequence 
corresponding to SEQ ID NO: 5. The hsp70 -related protein has 
an amino acid sequence corresponding to SEQ ID NO: 6 and a 
molecular weight from about 57 to about 61 kDa, preferably 
about 5 9 kDa. 

Another such DNA molecule, comprises an open reading frame 
which codes for a grapevine leafroll virus hsp90 -related 
protein and comprises the nucleotide sequence corresponding to 
SEQ ID NO: 7. The hsp90-related protein has an amino acid 
sequence corresponding to SEQ ID NO: 8 and a molecular weight 
from about 53 to about 57 kDa, preferably about 55 kDa. 

Another such DNA molecule comprises an open reading frame 
which codes for a grapevine leafroll virus coat protein or 
polypeptide. The DNA molecule comprises the nucleotide 
sequence corresponding to SEQ ID NO: 9. The coat protein has 
an amino acid sequence as given in SEQ ID NO: 10 and a 
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molecular weight from about 33 to about 43 kDa, preferably 
about 3 5 kDa. 

Alternatively, the DNA molecule of the present invention 
can constitute an open reading frame which codes for a first 
undefined protein or polypeptide. This DNA molecule comprises 
the nucleotide sequence corresponding to SEQ ID NO: 11. The • 
first undefined protein or polypeptide has an amino acid 
sequence corresponding to that in SEQ ID NO: 12 and a molecular 
weight from about 5 to about 7 kDa, preferably about 6 kDa. 

Another such DNA molecule constitutes an open reading 
frame which codes for a second undefined grapevine leaf roll 
virus protein or polypeptide and comprises the nucleotide 
sequence corresponding to SEQ ID NO:13. The second undefined 
protein or polypeptide has an amino acid sequence as given in 
SEQ ID NO: 14 and a molecular weight from about 4 to about 6 
kDa, preferably about 5 kDa. 

Another such DNA molecule constitutes an open reading 
frame which codes for a grapevine leafroll virus coat protein 
repeat and comprises the nucleotide sequence corresponding to 
SEQ ID NO: 15. The coat protein repeat has an amino acid 
sequence as given in SEQ ID NO: 16 and a molecular weight from 
about 51 to about 55 kDa, preferably about 53 kDa. 

Yet another such DNA molecule constitutes an open reading 
frame which codes for a third undefined grapevine leafroll 
virus protein or polypeptide and comprises the nucleotide 
sequence corresponding to SEQ ID NO: 17. The third undefined 
protein or polypeptide has an amino acid sequence as given in 
SEQ ID NO: 18 and a molecular weight from about 33 to about 3 9 
kDa, preferably about 3 6 kDa. 

Yet another DNA molecule which constitutes an open 
reading frame for a fourth undefined grapevine leafroll virus 
protein or polypeptide comprises the nucleotide seuqence 
corresponding to SEQ ID NO: 19. The fourth undefined protein 
or polypeptide has an amino acid sequence as given in SEQ ID 
NO: 20 and a molecular weight from about 17 to about 23 kDa, 
preferably about 20 kDa. 
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Yet another DNA molecule constitutes an open reading 
frame for a fifth undefined grapevine leafroll virus protein 
or polypeptide and comprises the nucleotide sequence 
corresponding to SEQ ID NO: 21. The fifth undefine protein or 
5 polypeptide has an amino acid sequence as given in SEQ ID 

NO: 22 and a molecular weight from about 17 to about 23 kDa, 
preferably about 20 kDa. 

Yet another DNA molecule of the present invention 
consitutes an open reading frame for a sixth undefined protein 
10 or polypeptide and comprises the nucleotide sequence 

corresponding to SEQ ID NO: 23. The sixth undefined protein or 
polypeptide has an amino acid sequence as given in SEQ ID 
NO: 24 and a molecular weight from about 5 to about 9 kDa, 
preferably about 7 kDa. 
15 Also encompassed by the present invention are fragments 

of the DNA molecules of the present invention. Suitable 
fragments capable of imparting grapevine leafroll resistance 
to grape plants are constructed by using appropriate 
restriction sites, revealed by inspection of the DNA 
20 molecule's sequence, to: (i) insert an interposon (Felley et 
al., Gene . 52:147-15 (1987), which is hereby incorporated by 
reference) such that truncated forms of the grapevine leafroll 
virus coat polypeptide or protein, that lack various amounts 
of the C-terminus, can be produced or (ii) delete various 
25 internal portions of the protein. Alternatively, the sequence 
can be used to amplify any portion of the coding region, such 
that it can be cloned into a vector supplying both 
transcription and translation start signals suitable for the 
desired host cell . 
3 0 Variants may also (or alternatively) be modified by, for 

example, the deletion or addition of nucleotides that have 
minimal influence on the properties, secondary structure and 
hydropathic nature of the encoded polypeptide. For example, 
the nucleotides encoding a polypeptide may be conjugated to a 
35 signal (or leader) sequence at- the N-terminal end of the 
protein which co- translationally or post-translationally 
directs transfer of the protein. The nucleotide sequence may 
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also be altered so that the encoded polypeptide is conjugated 
to a linker or other sequence for ease of synthesis, 
purification, or identification of the polypeptide. 

The protein or polypeptide of the present invention is 
5 preferably produced in purified form (preferably, at least 
about 80%, more preferably 90%, pure) by conventional 
techniques. Typically, the protein or polypeptide of the 
present invention is isolated after lysing or sonication. 
After washing, the lysate pellet is resuspended in buffer 

10 containing Tris-HCl. During dialysis, a precipitate forms 

from this protein solution. The solution is centrifuged, and 
the pellet is washed and resuspended in the buffer containing 
Tris-HCl. Proteins are resolved by electrophoresis through an 
SDS 12% polyacrylamide gel, 

15 The DNA molecule encoding the grapevine leafroll virus 

protein or polypeptide of the present invention can be 
incorporated in cells using conventional recombinant DNA 
technology. Generally, this involves inserting the coding 
sequence into an expression system to which the DNA molecule 

20 is heterologous (i.e. not normally present) . The heterologous 
DNA molecule is inserted into the expression system or vector 
in proper sense orientation and correct reading frame. The 
vector contains the necessary elements for the transcription 
and translation of the inserted protein-coding sequences as 

25 well known in the art. 

U.S. Patent No. 4,237,224 (Cohen and Boyer) , hereby 
incorporated by reference, describes the production of 
expression systems in the form of recombinant plasmids using 
restriction enzyme cleavage and ligation with DNA ligase. 

30 These recombinant plasmids are then introduced, e.g., by 
transformation, and replicated in unicellular cultures 
including procaryotes and eucaryotic cells grown in culture. 

Recombinant genes may also be introduced into virus 
vectors, such as vaccinia virus. Recombinant viruses can be 

35 generated by transfection of- plasmids into cells infected with 
virus . 
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Suitable vectors include, but are noc limited to, the 
following viral vectors such as lambda vector system gtll, gt 
VES.tB, Charon 4, and plasmid vectors such as p3R322, pBR325, 
pACYC177 , pACYC184 , pUC8, pUCS , pUC18, pUC19, pLG33 9, pR29 0, 
5 pKC37, pKClOl, SV 40, pBluescript II SX +/- or KS +/- (see 

fitr-ataaeTiA Pinning fiy^pms Catalog (1993) from Stratagene, La 
Jolla, CA, hereby incorporated by reference), pQE, pIH821, 
pGEX, pET series (see studier et . al . , Gene Expression 
Technology , vol. 185 (1990), hereby incorporated by 

10 reference), and any derivatives thereof. Recombinant 

molecules can be introduced into cells via transformation, 
transduction, conjugation, mobilization, or electroporation. 
The DNA sequences are cloned into the vector using standard 
cloning procedures in the art, as described by' Maniatis et 

15 al . , l^m'pnilar rioninar & laboratory Manual, Cold Springs 
Laboratory, Cold Springs Harbor, New York (1982), which is 
hereby incorporated by reference. 

A variety of host -vector systems may be utilized to 
express the protein-encoding sequence (s). Primarily, the 

20 vector system must be compatible with the host cell used. 
Host -vector systems include but are not limited to the 
following: bacteria transformed with bacteriophage DNA, 
plasmid DNA, or cosmid DNA; microorganisms such as yeast 
containing yeast vectors; mammalian cell systems infected with 

25 virus (e.g., vaccinia virus, adenovirus, etc.); insect cell 
systems infected with virus (e.g., baculovirus) ; and plant 
cells infected by bacteria or transformed via particle 
bombardment (i.e. biolistics) . The expression elements of 
these vectors vary in their strength and specificities. 

30 Depending upon the host-vector system utilized, any one of a 
number of suitable and well known transcription and 
translation elements can be used. 

Different genetic signals and processing events control 
many levels of gene expression (e.g., transcription and 

35 messenger RNA (mRNA) translation) . Transcription of DNA is 

dependent upon the presence of a promotor, a DNA sequence that 
directs the binding of RNA polymerase and thereby promotes 
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mRNA synthesis. The DNA sequences of eucaryotic promotors 
differ from those of procaryotic promotors. Furthermore, 
eucaryotic promotors and accompanying genetic signals, 
including enhancer-like sequences and inducible regulatory 
5 sequences, may not be recognized in or may not function in a 
procaryotic system, and, further, procaryotic promotors are 
usually not recognized and do not function in eucaryotic 
cells. 

Similarly, translation of mRNA in procaryotes depends 

10 upon the presence of the proper procaryotic signals, which 
differ from those of eucaryotes. Efficient translation of 
mRNA in procaryotes requires a ribosome binding site (Shine- 
Dalgarno (SD) sequence) on the mRNA. This sequence is a short 
nucleotide sequence that is located before the start codon, 

15 usually AUG, which encodes the N- terminal methionine of the 

protein. The SD sequences are complementary to the 3 ' -end of 
the 16S rRNA (ribosomal RNA) and promote binding of mRNA to 
ribosomes by duplexing with rRNA to allow correct positioning 
of the ribosome. For a review on maximizing gene expression, 

20 see Roberts and Lauer, ypi-lmrifi in Flnzvmoloav, 68:473 (1979), 
which is hereby incorporated by reference. 

Promotors vary in their "strength" (i.e. their ability to 
promote transcription) . It is generally desirable to use 
strong promotors in order to obtain a high level of 

25 transcription and, hence, expression of the cloned gene of 

interest. Depending upon the host cell system utilized, any 
one of a number of suitable promotors may be used. For 
instance, when cloning in E. coli, its bacteriophages, or 
plasmids, promotors such as the T7 phage promoter, lac 

30 promoter, tip promoter, recA promotor, ribosomal RNA promotor, 
the P R and P L promotors of coliphage lambda and others, 
including but not limited, to lacUVS, ompF, bla, lpp, and the 
like, may be used to direct high levels of transcription of 
adjacent DNA segments. Additionally, a hybrid trp-lacUV5 

35 (tac) promotor or other E. cpli promotors produced by 

recombinant DNA or other synthetic DNA techniques may be used 
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to provide for transcription of the inserted coding sequence 
or other inserted nucleic acid. 

Bacterial host cell strains and expression vectors may b 
chosen which inhibit the action of the promotor unless 
specifically induced. In certain operons, the addition of 
specific inducers is necessary for efficient transcription of 
the inserted DNA. For example, the lac operon is induced by 
the addition of lactose or IPTG (isopropylthio-beta-D- 
galactoside) . A variety of other operons, such as tip, pro, 
etc., have different under regulatory mechanisms . 

Specific initiation signals are also required for 
efficient gene transcription and translation in procaryotic 
cells. These transcription and translation initiation signals 
may vary in "strength" as measured by the quantity of aene 
specific messenger RNA and protein synthesized, respectively. 
The DNA expression vector, which contains a promotor, may also 
contain any combination of various "strong" transcription 
and/or translation initiation signals. For instance, 
efficient translation in E. coli requires a SD sequence about 
7-9 bases 5' to the initiation codon (ATG) to provide a 
ribosome binding site. Thus, any SD-ATG combination that can 
be utilized by host cell ribosomes may be employed. Such 
combinations include, but are not limited to, the SD-ATG 
combination from the cro gene or the N gene of coliphage 
lambda, or from the E. coli trp E, D, C, B or A genes. 
Additionally, any SD-ATG combination produced by recombinant 
DNA or other techniques involving incorporation of synthetic 
nucleotides may be used. 

Once the isolated DNA molecules encoding the various 
grapevine leaf roll virus proteins or polypeptides, as 
described above, have be en cloned into an expression system, 
they are ready to be incorporated in a host cell . Such 
incorporation can be carried out by the various forms of 
transformation noted above, depending upon the vector/host 
cell system. Suitable host cells include, but are not limited 
to, bacteria, yeast, mammalian cells, insect, plant, and the 
like. 
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The present invention also relates to RNA molecules which 
encode the various grapevine leafroll virus proteins or 
polypeptides described above. The transcripts can be 
synthesized using the host cells of the present invention by 
5 any of the conventional techniques . The mRNA can be 

translated either In vitro or in vivo. Cell-free systems 
typically include wheat -germ or reticulocyte extracts. 

One aspect of the present invention involves using one or 
more of the above DNA molecules encoding the various proteins 

10 or polypeptides of a grapevine leafroll virus to transform 

grape plants in order to impart grapevine leafroll resistance 
to the plants. The mechanism by which resistance is imparted 
is not known. As hypothesized, the transformed plant can 
express the coat protein or polypeptide, and, when the 

15 transformed plant is inoculated by a grapevine leafroll virus, 
such as GLRaV-1, GLRaV-2, GLRav-3 , GLRaV-4, GLRaV-5, or 
GLRaV-6, or combinations of these, the recombinantly expressed 
coat protein or polypeptide surrounds the virus, thereby 
preventing translation of the viral DNA. 

20 In this aspect of the present invention the subject 

coding sequence incorporated in the plant can be 
constitutively expressed. Alternatively, expression can be 
regulated by a promoter which is activated by the presence of 
grapevine leafroll virus. Suitable promoters for these 

25 purposes include those from genes expressed in response to 
grapevine leafroll virus infiltration. Additional suitable 
plant promoters include those which induce downstream gene 
expression in response to wounding, in response to elicitors 
and in response to virus infection. In the alternative, a 

30 constitutive plant-expressible promoter can be used; it is 
preferred that the level of gene expression is sufficiently 
high to provide virus resistance but not so high as to be 
detrimental to the normal functioning of the cell and tissues 
in which it is expressed. Immediately upstream of the start 

3 5 of a coding sequence for a GLRaV-3 protein or polypeptide in 

an expression system (expression vector, for use in plants) it 
is desired that there be a Kozak consensus for translation 
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intiation (AAXXATGG, where X is any of the four nucleotides) . 
Downstream of the end of the coding sequence for the virus 
protein or polypeptide, it is preferred that there be a 
polyadenylation signal functional in plants, such as that from 
5 the nopaline synthase gene, the octopine synthase gene or from 
the CaMV 35S gene. These sequences are well known in the 
plant biotechnology art. 

The DNA coding sequences and/or molecules of the present 
invention can be utilized to impart grapevine leafroll 

10 resistance for a wide variety of grapevine plants. The DNA 

molecules are particularly well suited to imparting resistance 
to Vitis scion or roots tock cultivars. Scion cultivars which 
can be protected include those commonly referred to as Table 
or Raisin Grapes, such as Alden, Almeria, Anab-E-Shahi , Autumn 

15 Black, Beauty Seedless, Black Corinth, Black Damascus, Black 
Malvoisie, Black Prince, Blackrose, Bronx Seedless, Burgrave, 
Calmeria, Campbell Early, Canner, Cardinal, Catawba, 
Christmas, Concord, Dattier, Delight, Diamond, Dizmar, 
Duchess, Early Muscat, Emerald Seedless, Emperor, Exotic, 

20 Ferdinand de Lesseps, Fiesta, Flame seedless, Flame Tokay, 

Gasconade, Gold, Himrod, Hunisa, Hussiene, Isabella, Italia, 
July Muscat, Khandahar, Katta, Kourgane, Kishmishi, Loose 
Perlette, Malaga, Monukka, Muscat of Alexandria, Muscat Flame, 
Muscat Hamburg, New York Muscat, Niabell, Niagara, Olivette 

25 blanche, Ontario, Pierce, Queen, Red Malaga, Ribier, Rish 
Baba, Romulus, Ruby Seedless, Schuyler, Seneca, Suavis (IP 
3 65) , Thompson seedless,, and Thomuscat. They also include 
those used in wine production, such as Aleatico, Alicante 
Bouschet, Aligote, Alvarelhao, Aramon, Baco blanc (22A) , 

3 0 Burger, Cabernet franc, Cabernet, Sauvignon, Calzin, 

Carignane, Charbono, Chardonnay, Chasselas dore, Chenin blanc, 
Clair ette blanche, Early Burgundy, Emerald Riesling, Feher 
Szagos, Femao Pires, Flora, French Colombard, Fresia, 
Furmint, Gamay, Gewurztraminer , Grand noir, Gray Riesling, 

35 Green Hungarian, Green Veltliner,. Grenache, Grillo, Helena, 
Inzolia, Lagrein, Lambrusco de Salamino, Malbec, Malvasia 
bianca, Mataro, Melon, Merlot, Meunier, Mission, Montua de 
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Pilas, Muscadelle du Bordelais, Muscat blanc, Muscat Ottonel 
Muscat Saint-Vallier, Nebbiolo, Nebbiolo fino, Nebbiolo 
Lampia, Orange Muscat, Palomino, Pedro Ximenes, Petit 
Bouschet, Petite Sirah, Peverella, Pinot noir, Pinot Saint - 
George, Primitivo di Gioa, Red Veltliner, Refosco, Rkatsiteli 
Royalty, Rubired, Ruby Cabernet, Saint -Emilion, Saint Macaire' 
Salvador, Sangiovese, Sauvignon blanc, Sauvignon gris, 
Sauvignon vert, Scarlet, Seibel 5279, Seibel 9110, Seibel 
130S3, Semillon, Servant, Shiraz, Souzao, Sultana Crimson, 
Sylvaner, Tannat, Teroldico, -Tinta Madeira, Tinto cao, 
Touriga, Traminer, Trebbiano Toscano, Trousseau, Valdepenas, 
Viognier, Walschriesling, White Riesling, and Zinfandel. 
Rootstock cultivars which can be protected include Couderc 
1202, Couderc 1613, Couderc 1616, Couderc 3309, Dog Ridge, 
Foex 33 EM, Freedom, Ganz in 1 (A x R #l) , Harmony, Kober 5BB, 
LN33, Millardet & de Grasset 41B, Millardet & de Grasset 420A, 
Millardet & de Grasset 101-14, Oppenheim 4 (S04) , Paulsen 775, 
Paulsen 1045, Paulsen 1103, Richter 99,_Ricnter 110, Riparia 
Gloire, Ruggeri 225, Saint -George, Salt Creek, Teleki SA, 
Vitis rupestris Constantia, Vitis California, and Vitis 
girdiana.. 

There is extensive similarity in the hsp70-related 
sequence regions of GLRaV-3 and other closteroviruses , such as 
tristeza virus. Consequently, the GLRaV-3 hsp70-related gene 
can also be used to produce transgenic cultivars other than 
grape, such as. citrus, which are resistant to closteroviruses 
other than grapevine leaf roll, including tristeza virus. 
These include cultivars of lemon, lime, orange, grapefruit, 
pineapple, tangerine, and the like, such as Joppa, Maltaise 
Ovale, Parson (Parson Brown) , Pera, Pineapple, Queen, 
Shamouti, Valencia, Tenerife, Imperial Doblefina, Washington 
Sanguine, Moro, Sanguinello Moscato, Spanish Sanguinelli, 
Tarocco, Atwood, Australian, Bahia, Baiana, Cram, Dalmau, 
Eddy, Fisher, Frost Washington, Gillette, LengNavelina, 
Washington, Satsuma Mandarin, Dancy, Robinson, Ponkan, Duncan, 
Marsh, Pink Marsh, Ruby Red, Red Seedless, Smooth Seville, 
Orlando Tangelo, Eureka, Lisbon, Meyer Lemon', Rough Lemon, 
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Sour Orange, Persian Lime, West Indian Lime, Bearss, Sweet, 
Lime, Troyer Citrange, and Citrus trifoliata. 

Plant tissue suitable for transformation include leaf 
tissue, root tissue, meristems, zygotic and somatic embryos, 
5 and anthers. It is particularly preferred to utilize embryos 
obtained from anther cultures . 

The expression systems of the present invention can be 
used to transform virtually any plant tissue under suitable 
conditions . Tissue transformed in accordance with the present 

10 invention can be grown in vitro in a suitable medium to impart 
grapevine leaf roll virus resistance. Transformed cells can be 
regenerated into whole plants such that the protein or 
polypeptide imparts resistance to grapevine leafroll virus in 
the intact transgenic plants. In either case, the plant cells 

IS transformed with the recombinant DNA expression system of the 
present invention are grown and express one of the above- 
described grapevine leafroll virus proteins or polypeptides 
and, thus, grapevine leafroll resistance. 

One technique of transforming plants with the DNA 

20 molecules of the present invention is by contacting the tissue 
of such plants with an inoculum of a bacterium transformed 
with a vector comprising a gene of the present invention which 
imparts grapevine leafroll resistance- Generally, this 
procedure involves inoculating the plant tissue with a 

25 suspension of bacteria and incubating the tissue for 4 8 to 
72 hours on regeneration medium without antibiotics at 25- 
2B°C. Cells of the genus Acprobacteriwn can be used to 
transform plant cells and/or plant tissue. Suitable species 
include Agirobacteriwn txmtefaciens and Agrrobacterium 

30 rhlzogenes. A. tumefaciens (e.g., strains C58, LBA4404, or 
EHA105) is particularly useful due to its well-known ability 
to transform plants, plant tissue and plant cells. 

Another approach to transforming plant cells with a gene 
which imparts resistance to pathogens is particle bombardment 

35 (also known as biolistic transformation) of the host cell. 
This can be accomplished in one of several ways. This 
technique is disclosed in U.S. Patent Nos. 4,945,050, 
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5,036,006, and 5,100,792, all to Sanford et al . , and in 
Emerschad et al. f ^^r r P n ^nort.g. 14:6-12 (1995)), all 
hereby incorporated by reference- Generally, this procedure 
involves propelling inert or biologically active particles at 
5 the cells under conditions effective to penetrate the outer 

surface of the cell and to be incorporated within the interior 
thereof. When inert particles are utilized, the vector can be 
introduced into the cell by coating the particles with the 
vector containing the heterologous DNA. Alternatively, the 

10 target cell can be surrounded by the vector so that the vector 
is carried into the cell by the wake of the particle. 
Biologically active particles (e.g., dried bacterial cells 
containing the vector and heterologous DNA} can also be 
propelled into plant cells. 

15 Once grape plant tissue is transformed in accordance with 

the present invention, it is regenerated to form a transgenic 
grape plant. Generally, regeneration is accomplished by 
culturing transformed tissue on medium containing the 
appropriate growth regulators and nutrients to allow for the 

20 initiation of shoot meristems. Appropriate antibiotics are 
added to the regeneration medium to inhibit the growth of 
Agrobacterium and to select for the development of transformed 
cells. Following shoot initiation, shoots are allowed to 
develop in tissue culture and are screened for marker gene 

25 activity. 

The DNA molecules of the present invention can be 
transcribed into mRNA, which, although encoding a grapevine 
leaf roll virus protein or polypeptide, is not translated to 
the corresponding protein. This is known as RNA-mediated 

3 0 resistance. When a Vitis scion or rootstock cultivar is 

transformed with such a DNA molecule, the DNA molecule can be 
transcribed under conditions effective to maintain the mRNA in 
the plant cell at low level density readings. Density 
readings of between 15 and 50 using a Hewlet ScanJet and Image 

35 Analysis Program are preferred. 
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The grapevine leafroll virus protein or polypeptide can 
also be used to raise antibodies or binding portions thereof 
or probes. The antibodies can be monoclonal or polyclonal. 

Monoclonal antibody production may be effected by 
techniques which are well-known in the art. Basically, the 
process involves first obtaining immune cells (lymphocytes) 
from the spleen of a mammal (e.g., mouse) which has been 
previously immunized with the antigen of interest either in 
vivo or in vitro. The antibody- secreting lymphocytes are then 
fused with (mouse) myeloma cells or transformed cells, which 
are capable of replicating indefinitely in cell culture, 
thereby producing an immortal, immunoglobulin-secreting cell 
line. The resulting fused cells, or hybridomas, are cultured, 
and the resulting colonies screened for the production of the 
desired monoclonal antibodies. Colonies producing such 
antibodies are cloned, and grown either in vivo or in vitro to 
produce large quantities of antibody. A description of the 
theoretical basis and practical methodology of fusing such 
cells is set forth in Kohler and Milstein, Nature , 256:495 

(1975) , incorporated by reference. 

Mammalian lymphocytes are immunized by in vivo 
immunization of the animal (e.g., a mouse) with the protein or 
polypeptide of the present invention. Such immunizations are 
repeated as necessary at intervals of up to several weeks to 
obtain a sufficient titer of antibodies. Following the last 
antigen boost, the animals are sacrificed and spleen cells 
removed. 

Fusion with mammalian myeloma cells or other fusion 
partners capable of replicating indefinitely in cell culture 
is effected by standard and well-known techniques, for 
example, by using polyethylene glycol (PEG) or other fusing 
agents. (See Milstein and Kohler, Eur. J. Immunol . r 6:511 

(1976) , incorporated by reference.) This immortal cell line, 
preferably murine, but may also be derived from cells of other 
mammalian species, including but. not limited to rats and 
humans, is selected to be deficient in enzymes necessary for 
the utilization of certain nutrients, capable of rapid growth, 
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and having good fusion capability. Many such cell lines are 
known to those skilled in the art, and others are regularly 
described. 

Procedures for raising polyclonal antibodies are also 
5 well known. Typically, such antibodies can be raised by 
administering the protein or polypeptide of the present 
invention subcutaneously to New Zealand white rabbits which 
have first been bled to obtain pre -immune serum. The antigens 
can be injected at a total volume of 100 fil per site at six 

10 different sites. Each injected material will contain 

synthetic surfactant adjuvant pluronic polyols, or pulverized 
acrylamide gel containing the protein or polypeptide after 
SDS-polyacrylamide gel electrophoresis. The rabbits are then 
bled two weeks after the first injection and periodically 

IS boosted with the same antigen three times every six weeks. A 
sample of serum is then collected 10 days after each boost. 
Polyclonal antibodies are then recovered from the serum by 
affinity chromatography using the corresponding antigen to 
capture the antibody. Ultimately, the rabbits are euthanized 

20 with pentobarbital 150 mg/Kg IV. This and other procedures 

for raising polyclonal antibodies are disclosed in Harlow et . 

al., editors, Antibodies; A T,ahn^tory Manual (1988), which 

is hereby incorporated by reference. 

In addition to utilizing whole antibodies, binding 

25 portions of such antibodies can be used. Such binding 

portions include Fab fragments, F(ab') 2 fragments, and Fv 
fragments. These antibody fragments can be made by 
conventional procedures, such as proteolytic^ fragmentation 
procedures, as described in Goding, Monoclonal AnMborH pr * 

30 Principle find Practice, New YorkrAcademic Press, pp. 98-118 
(1983) , hereby incorporated by reference. 

The present invention also relates to probes found either 
in nature or prepared synthetically by recombinant DNA 
procedures or other biological procedures . Nucleic acid 

3 5 probes can also be synthesized by manual chemical synthesis 
(see, e.g., Beaucage and Caruthers (1981) Tetra . t t 
22:1859-1862; MAtteuci et al . (1981) Ar. . rh-m . Snr 
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103:3185) or by automated chemical synthesis using 
commercially available equipment (e.g., Applied Biosystems, 
Foster City, CA) . Suitable probes are molecules which bind to 
grapevine leafroll viral antigens identified by the monoclonal 
antibodies of the present invention. Such probes can be, for 
example, proteins, peptides, lectins, or nucleic acid probes. 

The antibodies or binding portions thereof or probes can 
be administered to grapevine leafroll virus infected scion 
cultivars or rootstock cultivars. Alternatively, at least the 
binding portions of these antibodies can be sequenced, and the 
encoding DNA synthesized. The encoding DNA molecule can be 
used to transform plants together with a promoter which causes 
expression of the encoded antibody or binding portion thereof 
when the plant is infected by grapevine leafroll virus. In 
either case, the antibody or binding portion thereof or probe 
will bind to the virus and help prevent the usual leafroll 
response. 

Antibodies raised against the proteins or polypeptides of 
the present invention or binding portions of these antibodies 
can be utilized in a method for detection of grapevine 
leafroll virus in a sample of tissue, such as tissue from a 
grape scion or rootstock. Antibodies or binding portions 
thereof suitable for use in the detection method include those 
raised against a helicase, an RNA-dependent RNA polymerase, an 
hsp70-related, an hsp90-related, or a coat protein or 
polypeptide in accordance with the present invention. Any 
reaction of the sample with the antibody is detected using a 
reporter or other assay system which indicates the presence of 
grapevine leafroll virus in the sample. A variety of assay 
systems can be employed, such as enzyme-linked immunosorbent 
assays, radioimmunoassays, gel diffusion precipitin reaction 
assays, immunodif fusion assays, agglutination assays, 
fluorescent immunoassays, protein A immunoassays, or 
Immunoelectrophoresis assays . 

Alternatively, grapevine leafroll virus can be detected 
in such a sample using a nucleotide sequence of the DNA 
molecule, or a fragment thereof, encoding for a protein or 
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polypeptide (or a portion thereof) of the present invention. 
The nucleotide sequence is provided as a probe in a nucleic 
acid hybridization assay or a specific gene amplification 
detection procedure (e.g., using a polymerase chain reaction 
5 procedure) . Any reaction with the probe is detected so that 
the presence of grapevine leaf roll virus in the sample is 
indicated. 

The following examples are provided to illustrate 
embodiments of the present invention but are by no means 
10 intended to limit its scope. References cited in the Examples 
are incorporated by reference herein. 

EXAMPLES 

Example 1 - Materials and Methods 

15 

Virua purification and dsRNA isolation. The NY1 isolate, 
also referred to as isolate GLRaV 109 by Golino, Amp-r . ,T , 
Enol . Vifir . 43:200-205 (1992), a member of GLRaV- 3 (Hu et 
al., J. Phvtopathol (B«»rl.) . 128:1-14 (1990)); Zee et al., 

20 Phytopathology . 77:1427-1434 (1987)) was used throughout this 
work. Leaf roll-diseased canes and mature leaves were 
collected from a vineyard in central New York State, and kept 
at -20°c until used. GLRaV- 3 virus particles were purified 
according to the method described by Zee (1987) , and modified 

25 later by Hu (1990) . After two cycles of Cs 2 SO< gradient 
purification, virus particles were observed from virus - 
enriched fractions by negative staining on an electron 
microscope . 

The dsRNA was extracted from scraped bark/phloem tissue 
30 of canes as described in Hu (1990) . Briefly, total nucleic 
acid was extracted with phenol /chloroform; dsRNA was absorbed 
on a CF-ll cellulose column under 17V ethanol and eluted 
without ethanol. After two cycles of ethanol precipitation, 
dsRNA was analyzed by electrophoresis on a 6V polyacrylamide 
3 5 or IV agarose gel. A high Mr dsRNA (-16 kb) along with 
several smaller Mr dsRKAs was consistently identified in 
leafroll diseased but not in healthy samples (Hu (1990) . The 
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16 kb dsRNA, which was presumably a replicative form of the 
virus, was purified further following separation on a low 
melting temperature-agarose gel (Sambrook et al. f Molonil 
Clonincr. A Laboratory Manual. 2nd grf f cold Spring Harbor 
Laboratory Press (1389) . The double -stranded nature of the 
dsRNA was confirmed by its resistance to DNase and RNase in 
high salt and sensitivity to RNase in water (Hu (1990) . 

cDNA synthesis and molecular cloning. Complementary DNA 
(cDNA) was prepared by the procedure of Gubler et al., Gen<* r 
25:263 (1983), and modified for dsRNA by Jelkmann et al., 
Phytopathology, 79:1250-1253 (1989) . Briefly, following 
denaturation of about 2 /zg of dsRNA in 20 mM methylmercuric 
hydroxide (MeHg) for 10 min, the first -strand cDNA was 
synthesized by avian myeloblastosis virus (AMV) -reverse 
■transcriptase using random primers (Boehringer Mannheim, 
Indianapolis, IN) . The second-strand cDNA was synthesized 
with DNA polymerase I while RNA templates were treated with 
RNase H. The cDNA was size- fractionated on a CL-4B Sepharose 
column and peak fractions, which contained larger molecular 
weight cDNA, were pooled and used for cloning. Complementary 
DNA ends were blunted with T4 DNA polymerase, and EcoRI 
adapters were ligated onto a portion of the blunt -ended cDNA. 
After treatment with T4 polynucleotide kinase and removal of 
unligated adapters by spin column chromatography, the cDNA was 
ligated with lambda ZAFII/EcoRI prepared arms (Stratagene, La 
Jolla, CA) . These recombinant DNAs were packaged in vitro 
with GIGAPACK II GOLD™ packaging extract according to the 
manufacturer's instruction (Stratagene). The packaged phage 
particles were used to infect bacteria, E. coli XLl-blue 
cells. 

Screening the cDNA library. To select GLRaV-3 dsRNA 
specific cDNA clones, probes were prepared from UNI -AMP™ 
(Clontech, Palo Alto, CA) PCR-amplif ied cDNA. PCR-amplif ied 
GLRaV-3 cDNA was labeled with J2 P [a-dATP] by Klenow fragment 
of E. coii DNA polymerase I with random primers and used as a 
probe for screening the library (Feinberg et al., Analytic 
SiPChem . , 132:6-13 (1983)). Library screening was carried out 
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by transferring plaques grown overnight onto GENESCREEN PLUS™ 
filters, following the manufacturer's instructions for 
denaturation, prehybridization, and hybridization (Dupont, 
Boston, MA) . After washing, an autoradiograph was developed 
5 after exposing Kodak X-OMAT film to the washed filters 

overnight at -80°C. Bacteriophage recombinants were converted 
into plasmids (in vivo excision) following the manufacturer's 
instruction. (Stratagene) . 

Identification of the coat protein gene was done by 

10 immunoscreening the cDNA library with GLRaV-3 specific 
polyclonal (Zee (1987)) and monoclonal (Hu (1990) ) , 
antibodies. Degenerate primer ( 5 ' GGNGGNGGNACNTTYGAYGTNTCN 
(SEQ. ID. No, 19) , I=inosine, Y=T or C) generated from a 
conserved amino acid sequence in Motif C of the BYV HSP70 gene 

15 (p65) was used to select HSP70 positive clones. Further 
sequence extension was made possible by the clone walking 
strategy, which used sequences that flanked the sequence to 
probe the library for a clone that contained an insert 
extending farther in either 5' or 3 1 direction. 

20 Northern blot hybridization. Inserts from selected 

clones were labeled with "P[a»dATPj by Klenow fragment of E. 
coli DNA polymerase I (Feinberg (1983)), and used as probes to 
test their specific reactions to dsRNAs isolated from leafroll 
infected tissues. Double-stranded RNA isolated from GLRaV-3 

25 infected vines was separated by electrophoresis on a IV 

agarose gel (nondenatured condition) , denatured with 50 mM 
NaOH, 0.6 M NaCl for 3 0 min at room temperature, and 
neutralized with 1.5 M NaCl, 0.5 M Tris-HCl, pH 7.5 for 
another 30 min. Denatured dsRNA was sandwich-blotted onto a 

30 GENESCREEN PLUS™ membrane. Prehybridization and hybridization 
were carried out in a manner similar to that described above. 
The membrane was washed and exposed to Kodak X-OMAT film, and 
an autoradiograph was developed. 

Identification of immunopositive clones. For 

35 immunoscreening, plates with plaques appearing after 8-12 h 
incubation at 37°C were overlaid with a 10 mM isopropyl-p-D- 
thio-galactopyranoside ( IPTG) 
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impregnated Nylon filters (GENESCREEN PLUS™) and incubated for 
an additional 3-4 h. After blocking with 3h bovine serum 
albumin <BSA) , the blotted filter was incubated in a 1:1000 
dilution of alkaline phosphatase-conjugated GLRaV-3 polyclonal 
5 antibody for 3 h at 37°C. Positive signals (purple dots) were 
developed by incubation of washed filters in a freshly 
prepared nitroblue tetrazolium (NBT) and 5-brorao-4-chloro-3- 
indolyl phosphate (BCIP) solution. To further confirm whether 
or not a true GLRaV-3 coat protein expression plaque was 

10 selected, a secondary immunoscreening was carried out by 
reinfection of bacterial XL1 Blue cells with an earlier 
selected plaque. 

Western blot analysis. After secondary immunoscreening, 
GLRaV-3 antibody positive plaques were converted into plasmid, 

15 the pBluescript, by in vivo excision. Single colonies were 
picked up and cultured in LB medium with 100 /xg/ml of 
arapicillin until mid-log growth. Fusion protein expression 
was induced by addition of 10 mM IPTG with an additional 3 h 
of incubation at 37°C. Bacteria was pelleted and denatured by 

20 boiling in protein denaturation buffer (Sambrook (1989)). An 
aliquot of 5 /il denatured sample was loaded and separated by 
electrophoresis on a 12V SDS-polyacrylamide gel along with a 
prestained protein molecular weight marker (Bio-Rad, Hercules, 
CA) . The separated proteins were transferred onto an 

25 Immobulon membrane (Millipore) with an electroblotting 
apparatus (Bio-Rad) . After blocking with 3* BSA, the 
transferred membrane was incubated with 1:1,000 dilution of 
either GLRaV-3 polyclonal or monoclonal antibody/alkaline 
phosphatase conjugate. A positive signal was developed after 

3 0 incubation of the washed membrane in NBT and BCIP. 

PCR analysis. To analyze a cloned insert, "an aliquot of 
a bacterial culture was used directly in PCR amplification 
with common vector primers (SK and KS) . PCR-araplif ied product 
was analyzed by electrophoresis on an agarose gel. 

3 5 Nucleotide sequencing and computer sequence analysis. 

Plasmid DNA, purified by either a CsCl method (Sambrook 
(1989) ) or a modified mini alkaline-lysis/PEG Drecizsitation 
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procedure (Applied Biosystems- Instruction), was sequenced 
either with Sequenase version 2 kit following the 
manufacturer's instruction (US Biochemical, Cleveland, Ohio) 
or with Tag DYEDSOXY™ terminator cycle sequencing kit (Applied 
Biosystems, Inc.). Automated sequencing was conducted on an 
ABI373 automated sequencer. 

Nucleotide sequences were analyzed using a Genetics 
Computer Group (GCG) sequence analysis software package. 
(Madison, WI) . Sequence fragments were assembled using 
Newgelstart to initiate the GCG fragment assembly system and 
to support automated fragment assembly in GCG Version 7 2 

Computer-assisted analysis of phylogenetic relationships 
Amino acid sequences were either obtained from database Swiss " 
Prot or translated from nucleotide sequences obtained from 
GenBank. • A phylogenetic tree depicting a predicted 
relationship in the evolution of the GLRaV-3 coat protein 
sequence with those of other filamentous plant viruses was 
generated using the Clustal Method of the DNASTAR 1 s MegAlign 
program (Madison, WI, . with the Clustal method, a preliminary 
phylogeny is derived from the distances between pairs of input 
sequences and the application of the UPGMA algorithm (Sneath 
et al " Numeric Tpxnnnmv - ^ p^^ r i. B r nr , t>~,- r ,- rr n f 
mm~-<r^ T ffrnnfW r Freeman Press (1973)), which guides the 
alignment of ancestral sequences. The final phylogeny is 
produced by applying the neighborhood joining method of Saitou 
et al., mot . t, 1n i ffyn1 t 4:406-425 (1987), to the distance 
and alignment data. 

Nucleotide sequence and primer selection. The sequence 
rragment (Table IS, selected for PCR has now been identified 
to be from nucleotides 9364 to 10,011 of the incomolete GLRaV- 
3 genome (Table 4) . This sequence region encodes a short 
peptide which shares sequence similarity to HSP90 homologues 
of otner enteroviruses (Figure 1) . Selected primers and 
their designations are shown in Table 15, which shows the 
nucleotide and amino acid sequences of a PCR amplified 
fragment of the GLRaV-3 genome. "The external and internal 
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primers used for PGR are underlined and their orientations are 
indicated by arrows. 

Sample preparation. These include 1) dsRNA, 2) purified 
virus, 3) partially purified virus, 4) proteinase K treated 
crude extract, and 5) immuno- capture preparation. 

Isolation of dsRNA from leafroll infected grapevine 
tissues followed the procedure developed by Hu (1990). 

Virus purification was effected by the following 
procedure. An aliquot of 500 /il GLRaV-3 -enriched fractions 
after two cycles of Cs 2 S0 4 gradient was diluted with two 
volumes of TE buffer (10 mM Tris, 1 mM EDTA, pH 8.0) and 
incubated on ice for 5 min. The reaction was then adjusted to 
a final concentration of 200 mM NaAc, pH 5.0 # C.5V SDS, and 
200 fig/ml proteinase K and incubated at 37 °C for 3 h. Viral 
RNA was extracted with phenol and chloroform, ethanol- 
precipitated, and resuspended in 50 pel of diethyl 
pyrocarbonate (DEPC) -treated H 2 0. For each 100 pi pcr reaction 
mixture, 1 fil of purified viral RNA was used as template. 

Partially purified virus was prepared according to the 
virus purification procedure described in Hu (1990), but only 
to the high speed centrif ugation (27,000 rpra, 2 h) step 
without further Cs 2 S0 4 gradient centrif ugation. The pellet was 
resuspended in TE buffer and subjected to proteinase K 
treatment as described above. Viral RNA was extracted with 
phenol /chloroform and precipitated using ethanol. From 10 g 
of starting material, the pellet was resuspended in 200 /il of 
DEPC treated H 2 0. A 1 /zl aliquot of extracted RNA or its 10- 
fold dilution series (up to 10* 5 ) was used for reverse 
transcription- PCR (RT-PCR) . 

Crude extract was treated with Proteinase K as follows. 
Liquid nitrogen powdered grapevine bark/phloem tissue (100 mg) 
was macerated in 1 ml of virus extraction buffer (0.5 M Tris- 
HC1, pH 9.0, 0.01 M MgS0 4 , 4 V water insoluble polyvinyl 
pyrrolidone (PVP40) , 0.5V bentonite, 0.2% 2-mercaptoethanol, 
and 5V Triton X-100) (Zee (1987)) . After a brief 
centrifugation (5,000 rpm, 2 min), 500 /xl of supernatant was 
transferred into a new tube, adjusted to 100 /xg/ml proteinase 
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K, and incubated for 1 h at 55°C (Kawasaki, "Sample 
Preparation from Blood, Cells, and Other Fluids," in Innis et 
al., eds, PGR Protocols: A Gnide to Method* an H R rr 1 icat-^ nff 
Academic Press, Inc. (1990)). Following incubation, the 
preparation was boiled for 10 min to inactivate proteinase K 
and to denature the viral RNA. The upper clear phase was 
transferred into a new tube after a brief centrifugation. The 
viral RNA was precipitated with ethanol and resuspended in loo 
Ml of DEPC- treated H 2 0. An aliquot of 1 fil proteinase K- 
treated crude extract or its 10-fold dilution series (up to 
10" fi ) was used. 

The immuno- capture procedure was adapted from the method 
described by Wetzel at al., J. Virol. Meth. 39:27-37 (1992)). 
A 0.5 ml thin wall PCR tube was coated directly with 100 /xl of 
10 /xg/ml purified gamma-globulin from GLRaV-3 antiserum (Zee 
(1987)) in ELISA coating buffer (15 mM Na 2 C0 3 , 35 mM NaHC0 3/ pH 
9.6, and 0.02% NaN 3 ) and incubated for 4 h at 30°C. After 
washing 3 times with PBS-Tween-20, the antibody coated tube 
was loaded with 100 /zl of crude extract (1:10 or its 10-fold 
dilution series, up to 10"*) prepared in ELISA extraction 
buffer (50 mM sodium citrate, pH 8.3, 20 mM sodium 
diethyldithiocarbonate (DIECA) , 2% PVP 40K) and incubated at 
30 °C for 4 h. After washing, a 25 /il aliquot of transfer 
buffer (10 mM Tris, pH 8.0, IV Triton X-100) was added to the 
tube and vortexed thoroughly to release viral RNA. 

RT-PCR. Initially, reverse transcription (RT) and 
polymerase chain reaction (PCR) were performed in two separate 
reactions. An aliquot of 20 /il of reverse transcription 
reaction mixture was prepared to contain 2 /xl of 10X PCR 
buffer (Promega, Madison, WI) (10 mM Tris-HCl, pH 8*. 3, 500 mM - 
KC1, and 0.01* gelatin), 50 mM MgCl 3 , 2 /xl of 10 mM dNTP, 150 J 
ng of 5 1 and 3 1 primers, 16 units of RNasin, 25 units of avian 
myeloblastosis virus (AMV) reverse transcriptase, and 1 /xl of 
a denatured sample preparation. The reverse transcription 
reaction was carried out at 37°C for 3 0 min. After 
denaturation by heating at 95 °C for 5 min, an aliquot of PCR 
reaction mixture was added. This PCR reaction mixture (80 /xl) 
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contained 8 pi of 10X PCR buffer (Promega) , ISO mM MgCl 2 , 250 
ng of each 5 1 and 3' primer, 1 /zl of 10 mM dNTP, and 2.5 units 
of Taq DNA polymerase. The thermal cycling program was set as 
follows: a precycle at 92°C for 3 min; followed by 35 cycles 
of denaturation at 92°C, 1 min; annealing at 50°C, 1 min; and 
extension at 72°C, 2.5 min. The final extension cycle was set 
at 72°C for 5 min. 

Because reverse transcriptase functions in the PCR buffer 
system, RT and PCR can be combined (RT-PCR) in a single 
reaction (Ali et al., Biotftchni miss, 15:40-42 (1993); Goblet 
et al., Nnrlejc Acids Rfi^arrh. 17:2144 (1989)). The RT-PCR 
reaction mixture of 100 /il contains 10 /il of 10X PCR 
amplification buffer (Promega), 200 mM MgCl 2 , 250 ng each of 
primers, 3 fil of 10 mM dNTPs, 40 units of RNasin, 25 units of 
AMV or mo loney- murine leukemia virus (M-MLV) reverse 
transcriptase, 2.5 units of Taq DNA polymerase, and 1 /il of 
denatured sample preparation. The thermal cycling program was 
set as follows: one cycle of cDNA synthesis step at 37°C for 
3 0 min, immediately followed by PCR cycling as described 
above . 

Nested PCR. Inconsistent results obtained from a single 
round of PCR amplification prompted an investigation intq the 
feasibility of Nested PCR, Initial PCR amplification was 
performed with an external primer set (93-110 & 92-98) (Table 
15) . A PCR product of 648 bp was consistently observed from 
dsRNA as template, but the expected PCR product was not 
consistently observed in samples prepared from proteinase En- 
treated crude extract or immuno- capture sample preparation. 
Consequently, additional PCR amplification with an internal 
primer set (93-25 & 93-40) was carried out by adding 5 fil of 
the first external primer-amplified PCR product into a freshly 
prepared 100 pi PCR reaction mixture. The PCR cycling 
parameters were as described above. 

Example ? - Vin^ PnrifiggM'on and riaRNA Isolation. 

GLRaV-3 virus particles were purified directly from field 
collected samples of infected grapevines. Attempts to use 
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genomic RNA for cDNA cloning failed due to low yield of virus 
particles with only partial purity. However, virus particles 
were shown to be decorated by GLRaV-3 antibody using electron 
microscopy. The estimated coat protein molecular weight of 
41K agreed with an earlier study (Hu (1990) . Because of low 
yield in virus purification, dsRNA isolation was further 
pursued. Based on the assumption that high Mr dsRNA (16 kb) 
is the replicative form of the GLRaV-3 genomic RNA, this high 
Mr dsRNA was separated from other smaller ones by 
electrophoresis (Figure 2) , purified from a low melting 
temperature agarose gel, and used for cDNA synthesis. 

Example 3 - nPNA Svnthpgis. Molecular Cloning, and Analyst of 
cDNA Clones. 

First-strand cDNA was synthesized with AMV reverse 
transcriptase using purified 16 kb dsRNA which had been 
denatured with 10 mM MeHg as template. Only random primers 
were used to prime the denatured dsRNA because several other 
closteroviruses (BYV, CTV, and LIYV) have been shown to have 
no polyadenylated tail on the 3 1 end (Agranovsky et al., jL_ 
Gen, virol . . 72:15-24 (1991)}; Agranovsky et al., Virology, 
198:311-324 (1994); Karasev et al., Virology, 208:511-520 
(1995); Klaassen et al., Virology , 208:99-110 (1995); Pappu et 
al., Virology . 199:35-46 (1994)). After second-strand cDNA 
synthesis, the cDNA was size-fractionated on a CL-4B Sepharose 
column, and peak fractions which contained larger molecular 
weight cDNA were pooled and used for cloning. An 
autoradiograph of this pooled cDNA revealed cDNA molecules up 
to 4 kb in size. 

A lambda ZAPII library was prepared from cDNA that was 
synthesized with random primed, reverse transcription of 
GLRaV-3 specific dsRNA. Initially, white/blue color selection 
in IPTG/X-gal containing plates was used to estimate the ratio 
of recombination. There were 15.7% white plaques, and an 
estimated 7 X 10 4 GLRaV-3 specific recombinants in this cDNA 
library. The library was screened with probes prepared from 
UNI -AMP™ PCR-amplif ied GLRaV-3 cDNA. More than 3 00 clones 
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with inserts of up to 3 kb were selected after screening the 
cDNA library with probe prepared from UNI -AMP™ PCR-amplif ied 
GLRaV-3 cDNA. In Northern blot hybridization, a probe 
prepared from a clone insert, pC4 , reacted strongly to the is 
kb dsRNA as well as to several other smaller Mr dsRNAs. Such 
a reaction was not observed with nucleic acids from healthy 
grape or with dsRNA of CTV (Figure 1) . 

ExaTTOlff * - Selection and Characterizatio n nf TmryT^nnpoQif-iy ^ 

A total of 6 X 10* plaques were immunoscreened with GLRaV- 
3 specific polyclonal antibody. Three cDNA clones, designated 
pCPS, pCFB-4, and pCP10-l, produced proteins that reacted to 
the polyclonal antibody to GLRaV-3 . GLRaV-3 antibody 
specificity of the clones was further confirmed by their 
reaction to GLRaV-3 monoclonal antibody. PCR analysis of 
cloned inserts showed that a similar size of PCR product (1.0- 
1.1 kb) was cloned in each immunopositive clone using primers 
corresponding to flanking vector seuqences (SK and KS) . 
However, various sizes of antibody-reacting protein were 
produced from these clones, which suggested that individual 
clones were independent and contained different segments of 
the coat protein gene. The Mr of immunopositive fusion 
protein from clone pCP10-l was estimated to be 50K in SDS- 
PAGE, which was greater than the native coat protein of 41K 
(compare lanes 1 to 4 in Figure 3). Immunopositive proteins 
produced in clone pCP5 (Figure 3, lane 2) and pCP8 (Figure 3, 
lane 3) were different in size and smaller than the native 
coat protein. Clone pCP5 produced a GLRaV-3 antibody- reacting 
protein of 23K. Clone pCP8-4, however, produced an antibody- 
reacted protein of 27K. Similar banding patterns were 
observed when either polyclonal (Figure 3A) or monoclonal 
(Figure 3B) antibodies were used in Western blots. These 
results indicated that these cDNA clones contained coding 
sequences for the GLRaV-3 coat protein gene. 
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E x a iTTOlft «j - Nucleotide Sequencing and Trf» n H f i raM' on of ffrp 
Coat P--nt-P-in ff^ne 

Both strands of the three immunopositive clones were 
sequenced at least twice. A multiple sequence alignment of 
these three clones overlapped and contained an incomplete ORF 
lacking the 3' terminal sequence region. The complete 
sequence of this ORF was obtained by sequencing an additional 
clone, pA6-8, which was selected using the clone walking 
strategy. The complete ORF potentially encoded a protein of 
313 amino acids with a calculated Mr of 34,866 (p35) (Figure 4 
and Tables 2-3). Table 2 shows the nucleotide and amino acid 
sequences of the coat protein gene of grapevine leafroll 
associated closterovirus-3 , isolate NY1. Nucleotide 
sequencing was conducted by the procedure described in 
Example 1. The translated amino acid sequence is shown below 
the nucleotide sequence. Table 3 compares the alignment of 
the coat protein of GLRaV-3 with respect to BYV, CTV, and 
LIYV. Consensus amino acid residues are shown. Uppercase 
letters indicate identical amino acids, and lowercase letters 
indicate at least three identical or functionally similar 
amino acids. The three conserved amino acid residues (S, R, 
and D) identified in all filamentous plant virus coat proteins 
are in bold (Dolja et al., Vi ro 1 ogy. 184:79-86 (1991)). 

Because this ORF was derived from three independent 
clones after screening with GLRaV-3 coat protein specific 
antibody, it was identified as the coat protein gene of GLRaV- 
3. A multiple amino acid sequence alignment of p35 with the 
coat proteins of other closteroviruses, including BYV, CTV, 
and LIYV, is presented in Table 3. The typical consensus 
amino acid residues (S, R, and D) of the coat proteins of the 
filamentous plant viruses (Dolja et al., Virology., 184:79-86 
(1991) ) , which may be involved in salt bridge formation and 
the proper folding of the most conserved core region (Boyko et 
a1 -' Prac. Natl. Acad, Sri n,gR f 89:9156-9160 (1992)), were 
also preserved in the p35. Phylogenetic analysis of the 
GLRaV-3 coat protein amino acid sequence with respect to the 
other filamentous plant viruses placed GLRaV-3 into a separate 
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but closely related branch of the closterovirus (Figure 5) . 
Direct sequence comparison of GLRaV-3 coat protein with 
respect to other closterovirus coat proteins or their diverged 
copies by the GCG Pileup program demonstrated that at the 
nucleotide level, GLRaV-3 had its highest homology to BYV 
(41.5%) and CTV (40.3%). At the amino acid level, however, 
the highest percentage similarity were to the diverged copies 
of coat protein, with 23.5% identity (46.5% similarity) to CTV 
p26 and 22.6% (44.3% similarity) to BYV p24 . 



Rvample ff - TAent i f i rat ion of a O n at. Protein Translaf-inn 
Init iat i ort Site 

Various sizes of GLRaV-3 specific antibody-reactive 
proteins were produced by three immunopositive clones in E. 

15 coli (Figure 3) . Sequences of these clones overlapped and 
represented a common ORF that was identified as the coat 
protein gene (Figure 4) . In searching for possible 
translation regulatory elements, sequence analysis beyond the 
coat protein coding region revealed a purine rich sequence, - 

20 uG&GuGAAcgcgAES- (SEQ ID NO:26) , which was similar to the 

Shine -Dalgarno sequence (uppercase letters) (Shine et al., 
Pmr. Natl Ararf. fiei . USA . 71:1342-1346 (1974),), upstream 
from the coat protein initiation site (AUG) . This purine 
rich sequence can serve as an alternative ribosome entry site 

25 for the translation of the GLRaV-3 coat protein gene in E. 

coli. If this first AUG in the ORF serves for coat protein 
translation, the ribosomal entry site must be located in this 
purine rich region because an in- frame translation stop codon 
(UGA) was only nine nucleotides upstream from the coat protein 

30 gene translation initiation site (AUG) . Analysis of 

nucleotide sequence beyond the cloned insert into the vector 
sequence of clone pCP8-4 and pCP10-l provided direct evidence 
that the fusion protein was made from the N-terminal portion 
of coat protein and C- terminal portion of 0-galactosidase 

35 (16.5KJ. Further analysis of sequence around the selected AUG 
initiation codon of the coat protein gene revealed a consensus 
sequence 
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(-GnnADSG-) that favored the expression of eucaryotic mRNAs 
(Kozak, Mirrohiolocnral Reviews , 47:1-45 (1983); Kozak, £fill, 
44:283-292 (1986) ) . 

Nucleotide sequence analysis of three immunopositive 
5 clones revealed overlapping sequences and an ORF that covers 
about 96% of the estimated coat protein gene (Figure 4) . The 
complete ORF was obtained after sequencing of an additional 
clone (pA6-8) that was selected by the clone walking strategy. 
Identification of this ORF as the coat protein gene was based 

10 upon its immunoreactivity to GLRaV-3 polyclonal and monoclonal 
antibodies, the presence of filamentous virus coat protein 
consensus amino acid residues (S, R, and D) , and the 
identification of a potential translation initiation site. 
The calculated coat protein molecular weight (35K) is smaller 

15 than what was estimated on SDS-PAGE (4 IK) . This discrepancy 
in molecular weight between computer-calculated and SDS-PAGE 
estimated falls in the expected range. 

The estimated coat protein Mr of GLRaV-3 and another 
grape closterovirus-like designated GLRaV-1 are larger than 

20 the 22-2BK coat protein range reported for other well 

characterized closteroviruses such as BYV, CTV, and LIYV 
(Agranovsky (1991); Bar-Joseph et al., "Closteroviruses, » 
cmt/AAQ . No. 260 (1982), Klaassen et al., > T . fifln , Virol , , 
75:1525-1533 (1994); (Martelli et al . ( "Closterovirus , 

25 Classification and Nomenclature of Viruses, Fifth Report of 
the International Committee on Taxonomy of Viruses," in 
Arrhivps of Vi rolnoy Sup pl ffnpnr.iim 2, Martelli et al . , eds . , 
New York: Springer-Verlag Wein, pp. 345-347 (1991); Sekiya et 
al. f .t r^n Virol. . 72:1013-1020 (1991)). Hu (1990) 

30 suggested a possible coat protein dimer. The present sequence 
data, however, do not support this suggestion. First, the 
size of the coat protein is 35K, which is smaller than what 
would be expected of a coat protein dimer. Second, a multiple 
sequence alignment of N-terminal half and C-terminal half of 

35 GLRaV-3 coat protein with the coat proteins of other 

closteroviruses showed that the filamentous virus coat protein 
consensus amino acid residues (S, R, and D) are only present 
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in the C-terminal portion, but not in the N-terminal portion 
of the coat protein. 

Example 7 - Primpr fipWf"^ nn 

Primers were selected based on the nucleotide sequence of 
clone pC4 which had been shown to hybridize to GLRaV-3 dsRNAs 
on a Northern hybridization (Figure 1) . The 648 bp sequence 
amplified by PCR was identified as nucleotides 9,364 to 10,011 
of the incomplete GLRaV-3 genome (Table 4). This sequence' " 
fragment encodes a short peptide which shows some degree of 
amino acid sequence similarity to heat shock protein 90 
(HSP90) homologues of other closteroviruses, BYV, CTV, and 
LIYV (Table 5) . Two sets of primer sequences and their 
designations (external, 93-110 t 92-98, and internal, 93-25 & 
93-40) are shown in Table 15. Effectiveness of synthesized 
primers to amplify the expected PCR product was first 
evaluated on its respective cDNA clone, pC4 (Figure 6, lane 
11) . 



Ex a mple 8 - Development of a simple and Bf^f-j Y p ppp s amr ^ r 

Prepa-raf j nn , 

Initially purified dsRNA was used in a RT-PCR reaction. 
The expected 219 bp PCR product was consistently observed with 
the internal set of primers (Figure 6, lane 10) . To test 
whether or not these primers derived from GLRaV-3 specific 
dsRNA sequence is in fact the GLRaV-3 genome sequence, RNA 
extracted from a highly purified virus preparation was 
included in an assay. As expected, PCR products with similar 
size (219 bp) were observed in cloned plasmid DNA (pC4) 
(Figure 6, lane 11), dsRNA (Figure 6, lane 10) as well as 
purified viral RNA (Figure 6, lane 9) . This PCR result was 
the first evidence that dsRNA isolated from leafroll-infected 
tissue was derived from the GLRaV-3 genome. However, PCR 
sample preparations from the purified virus procedure are too 
complicated to be used for leaf roll diagnosis. Simplification 
sample preparations used viral RNA extracted from a partially 
purified virus preparation. This partially purified virus 
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preparation was again shown to be effective in RT-PCR (Figure 
G) . Sensitivity of RT-PCR was further evaluated with 10 -fold 
serial dilution (up to 1CT 5 > of a sample. The expected PCR 
product of 219 bp in a partially purified virus preparation 
5 was observable up to the ICT 3 dilution (Figure 6, lane 4) . 
Although RT-PCR was shown again to work with partially 
purified virus preparations, this method of sample preparation 
was still too complicated to be used in a routine disease 
diagnosis. Over 10 attempts to directly use crude extract for 
10 RT-PCR were unsuccessful. Proteinase K- treated crude extract 
was by far the most simple and still effective pretreatment 
for RT-PCR. Therefore, the proteinase K- treated crude extract 
was used to evaluate RT-PCR for its ability to detect GLRaV-3. 

15 Example 9 - RT-PCR 

With proteinase K- treated crude extract prepared from 
scraped phloem tissue collected from a typical leafroll 
infected vine (Doolittle's vineyard, New York), a PCR product 
of 219 bp was readily observed. However, application of this 

20 sample preparation method to other field collected samples 
(USDA, PGRU, Geneva, NY) was disappointing. With different 
batches of sample preparations, a range of 3 to 10 out of 12 
ELISA positive samples were shown to have the expected PCR 
products. To determine whether these inconsistent results 

25 were due to some kind of enzyme (reverse transcriptase or Taq 
DNA polymerase) inhibition present in the proteinase K-treated 
crude extract, increasing amounts of a sample were added into 
an aliquot of 100 /il PCR reaction mixture. PCR products of 
219 bp were readily observed from samples of 0.1 fil (lane 1) 

30 and 1 ptl (lane 2) but not from 10 pi. Presumably, sufficient 
amount of enzyme inhibitors was present in the .10 /il of this 
sample . 

Example 10 - Tiwimino-canMirP HT-PCR 
35 The immuno- capture method further simplified sample 

preparation by directly using crude extracts that were 
prepared in the standard ELISA extraction buffer. Immuno- 
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capture RT-PCR (IC RT-PCR) tests were initially performed with 
the internal primer set, and the expected PCR product of 219 
bp was observed from a typical leaf roll infected sample . 
However, this PCR method to test a range of field collected 
5 ELISA positive samples gave inconsistent results. In a PCR 

test performed with the external primer set, only five out of 
seven field collected ELISA positive samples were shown to 
amplify the expected PCR product (648 bp) (Figure 7A) . 
Meanwhile, the expected PCR product was consistently observed 
10 in dsRNA (Figure 7A, lane 10) , but such product was never 

observed in the healthy control (Figure 7A, lane 9) . In this 
case, however, the expected PCR product was not observed in a 
sample prepared using proteinase K-treated crude, extract 
(Figure 7A, lane 8) . 

15 

Pvamplfi 11 - Nftfif.ftd PCR 

As described above, inconsistency of RT-PCR was 
experienced with samples prepared either by the proteinase K- 
treated or by the imirtuno -capture methods. If this PCR 

20 technique is to be used in disease diagnosis, a consistent and 
reproducible result is needed. Thus, the Nested PCR method 
was introduced. Although an expected PCR product of €4 8 bp 
from the first PCR amplification with the external primer set 
was not always observable (Figure 7A) , in a Nested PCR 

25 amplification with the internal primer set, the expected 219 
bp PCR product was consistently observed from all seven ELISA 
positive samples (Figure 7B) . These products were observed in 
dsRNA (Figure 7B, lane 10) and in the proteinase K-treated 
crude extract (Figure 7B, lane 8) but not in a healthy control 

30 (Figure 7B, lane 9) . To determine the sensitivity of Nested 

PCR with samples prepared either by proteinase K-treated or by 
immuno- capture methods, Nested PCR and ELISA were performed 
simultaneously with samples prepared from a 10-fold dilution 
series. The sensitivity of Nested PCR was shown to be 10" s in 

35 proteinase K-treated crude extract (Figure 8A) , and was more 
than 10"* (the highest dilution point in this test) in an 
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immuno- capture preparation (Figure 8B) . With similar sample 
preparations, sensitivity for ELISA was only 10* 2 . 

Example 12 - Va lidation of PCR with ELTSA and indexing 

To determine whether the PCR-based GLRaV-3 detection 
method described in this study has a practical application in 
grapevine leaf roll disease diagnosis, a validation experiment 
with plants characterized thoroughly by ELISA and indexing is 
necessary. Several grapevines collected at USDA-PGRU at 
Geneva, New York, which have been well characterized by 3 -year 
biological indexing and by ELISA were selected for validation 
tests . A perfect correlation was observed between ELISA 
positive and PCR positive samples, although there was some 
discrepancy over indexing which suggested that other types of 
closteroviruses may also be involved in the grapevine leafroll 
disease (Table 7) . 

PCR technology has been applied to detect viruses, 
viroids and phytoplasmas in the field of plant pathology (Levy 
et al., Journal of Viroloc n'ral Methods. 49:295-304 (1994)). 
Because of the presence of enzyme inhibitors (reverse 
transcriptase and/or Taq DNA polymerase) in many plant 
tissues, a lengthy and complicated procedure is usually 
required to prepare a sample for PCR. In studies of PCR 
detection of grapevine fanleaf virus, Rowhani et al. , 
Phytopathol ngy . 83:749-753 (1993), have already observed an 
enzyme inhibitory phenomenon. Substances including phenolic 
compounds and polysaccharides in grapevine tissues were 
suggested to be involved in enzyme inhibition. 

One of the objectives in the present study was to develop 
a sound practical procedure for sample preparation to 
eliminate this inhibitory problem for PCR detection of GLRaV-3 
in grapevine tissues. Although the expected PCR product was 
consistently observed from samples of dsRNA, purified vims 
and partial purified virus, proteinase K-treated crude extract 
and immuno -capture methods were the simplest and were still 
effective. Samples prepared with proteinase K-treated crude 
extract have an advantage over others in that hazardous 
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organic solvents, such as phenol and chloroform, are avoided. 
However/ care must be taken in the sample concentration 
because the reaction can be inhibited by adding too much 
grapevine tissue. Minafra et al., J . Virol . MftthnHs. 47; 175- 
188 (1994) , reported the successful PCR detection of grapevine 
virus A, grapevine virus B, and GLRaV-3 with crude saps 
prepared from infected grapevine tissues, this method of 
sample preparation was, however, not effective in the present 
study. The similar primers used by Minafra (1994) , were, 
however, able to amplify the expected size of PCR products 
from dsRNA of the NY1 isolate of GLRaV-3. 

Immuno- capture is another simple and efficient method of 
sample preparation (Wetzel (1992) , which is hereby 
incorporated by reference) . First, crude ELISA extracts can 
be used directly for RT-PCR. Second, it provides not only a 
definitive answer, but may also be an indication to a virus 
serotype. Third, with an immuno -capture step, virus particles 
are trapped by an antibody, and inhibitory substances may be 
washed away. Nested PCR with samples prepared by the immuno- 
capture method is 10 3 times more sensitive than with samples 
prepared by proteinase K- treated crude extract. However, this 
approach requires a virus specific antibody. For some newly 
discovered or hard to purify viruses, a virus specific 
antibody might not be available. There are at least six 
serologically distinctive closteroviruses associated with 
grapevine leaf roll disease (Boscia (1995)). 

ExampIP 13 - Nucleotide Sequence and Open Reading Frames 

A lambda ZAPII library was prepared from cDNA that was 
synthesized with random primed, reverse transcription of 
GLRaV-3 specific dsRNA. Initially, white/blue color selection 
in IPTG/X-gal containing plates was used to estimate the ratio 
of recombination. There were 15.7^ white plaques, or an 
estimated 7 X 10 A GLRaV-3 specific recombinants in this cDNA 
library. The library was screened with probes prepared from 
UNI -AMP™ PCR-amplified GLRaV-3 cDNA. More than 300 clones 
with inserts of up to 3 kb were selected after screening the 
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cDNA library with probe prepared from UNI -AMP™ PCR-amplif ied 
GLRaV-3 cDNA. In Northern blot hybridization, a probe 
prepared from the cloned insert of pC4, reacted strongly to 
the 16 kb dsRNA as well as to several other smaller Mr dsRNAs. 
No hybridization with nucleic acids from healthy grape or to 
dsRNA of CTV was observed (Figure 1) . 

Clone pB3-l was selected and sequenced after screening 
the library with HSP70 degenerate primer 

(5 ' GGIGGIGGIACITTYGAYGTITCI (SEQ ID NO:2S)). Other clones 
that were chosen for nucleotide sequencing were selected by 
the clone walking strategy. The nucleotide sequencing 
strategy employed was based on terminal sequencing of random 
selected clones assisted with GCG fragment assembly program to 
assemble and extend the sequence. The step-by-step primer 
extensioii method was used to sequence the internal region of a 
selected clone. A total of 54 clones were selected for 
sequencing. Among them, 16 clones were completely sequenced 
on both DNA strands (Figure 9) . 

A total of 15/227 nucleotides were sequenced; nine open 
reading frames (ORFs) were identified designated as ORFs la, 
lb, and 2 to 8. The sequenced region was estimated to cover 
about 80% of the complete GLRaV-3 genome. Major genetic 
components, such as helicase (ORF la), RdRp (ORF lb), HSP70 
homologue (ORF 4) , HSP90 homologue (ORF 5) and coat protein 
(ORF 6) were identified. 

ORF la was an incomplete ORF from which the 5' terminal 
portion has yet to be cloned and sequenced. The sequenced 
region presented in Figure 10 and Table 4 represents 
approximately two- thirds of the expected ORF la , as compared 
to the ORF la from BYV, CTV, and LIYV. The partial ORF la was 
terminated by the UGA stop codon at positions 4165-4167; the 
respective product consisted of 13 88 amino acid residues and 
had a deduced Mr of 148,603. Database searching indicated 
that the C- terminal portion of this protein shared significant 
similarity with the Superfamily 1 helicase of positive-strand 
RNA viruses. Comparison of the conserved domain region (291 
amino acids) showed a 38.4% identity with an additional 19.7% 
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similarity between GLRaV-3 and BYV and a 32.4% identity with 
an additional 21.1% similarity between GLRaV-3 and LIYV (Table 
6) . Six helicase conserved motifs of Superfamily 1 helicase 
of positive -strand RNA viruses (Hodgman, Nature . 333:22-23 
(Erratum 578) (1988); Koonin et al., Crit. P«»v. Binrh Pfn 
Molec , Biol , , 28:375-430 (1993)) were also retained in GLRaV- 
3. Analysis of the predicted phylogenetic relationship in 
helicase domains between GLRaV-3 and the other positive-strand 
RNA viruses placed GLRaV-3 along with the other 
closteroviruses, including BYV, CTV, and LIYV, into the 
"tobamo" branch of the alphavirus-like supergroup (Figure 11, 
Table 5) . Table 5 compares the amino acid sequence alignment 
of the helicase of GLRaV-3 with respect to BYV, .CTV, and LIYV. 
Consensus amino acid residues are shown. Uppercase letters 
indicate identical amino acids, lowercase letters indicate at 
least three identical or functionally similar amino acids. 
Six conserved motifs (I to VI) that are conserved among the 
Superfamily 1 helicase (Koonin et al., Crit. Ppv. in nio r h fni 
MQle<7, fliol , , 28:375-430 (1993)) of the positive-strand RNA 
viruses are overlined. 

ORF lb overlapped the last 113 nucleotides of ORF la and 
terminated at the UAG codon at positions 5780 to 5782. This 
ORF encodes a protein of 536 amino acid residues with a 
calculated Mr of 61,050 (Figures 10, Table 4) . Database 
screening of this protein revealed significant similarity to 
the Supergroup 3 RdRp of the positive-strand RNA viruses. 
Sequence comparison of GLRaV-3 with BYV, LIYV, and CTV over a 
313 -amino acid sequence fragment revealed a striking amino 
acid sequence similarity among eight conserved motifs (Table 
8) * Consensus amino acid residues are shown. Uppercase 
letters indicate identical amino acids, and lowercase letters 
indicate at least three identical or functionally similar 
amino acids. The motifs (I to VIII) that are conserved among 
the Supergroup 3 RNA polymerase of positive-strand RNA viruses 
are overlined. The best alignment was with BYV, while the 
least alignment was with LIYV (Table 6) . Analysis of 
predicted phylogenetic relationships of the RdRp domains of 
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the alphavirus-like supergroup viruses again placed GLRaV-3 
into the tobamo branch along with other closteroviruses, BYV, 
CTV # BYSV, and LIYV (Figure 12) . 

Publications on BYV, CTV, and LIYV have proposed that ORF 
lb is expressed via a +1 ribosomal frameshift (Agranovsky 
(1994); Dolja et al.. Arm. Rev. Phytopathol . , 32:261-285 
(1994); Karasev (1995), and Klaassen (1995)). Direct 
nucleotide sequence comparison was performed within the 
ORFla/lb overlap of GLRaV-3 with respect to BYV, CTV, or LIYV. 
An apparently significant similarity was observed only to LIYV 
(Table 9), and not to BYV or CTV. The so-called "slippery" 
GGGUUU sequence and the stetn-and-loop structure that were 
proposed to be involved in the BYV frameshift was absent from 
the GLRaV-3 ORFla/lb overlap. The predicted frameshift within 
the GLRaV-3 ORF la/lb overlap was selected based on an 
inspection of the C-terminal portion of the helicase alignment 
and the N-terminal portion of the RdRp alignment between 
GLRaV-3 and LIYV. 

Table 9 compares the aligned GLRaV-3 and LIYV nucleotide 
sequences (presented as DNA) in the vicinity of the proposed 
frameshift, nt 4099-4165 in GLRaV-3 and nt 5649-5715 in LIYV. 
Identical nucleotides are uppercase. LIYV predicted +1 
frameshift region (aAAG) and the corresponding GLRaV-3 (cACA) 
are bold and italic. The encoded C-terminus of HEL and N- 
terminus of RdRp are presented above (GLRaV-3) and below 
(LIYV) the nucleotide alignment. Repeat sequences are 
underlined. 

The GLRaV-3 ORF la/lb frameshift was predicted to occur in 
the homologous region of the LIYV genome, and was also 
preceded by a repeat sequence (GCTT) (Figure 24) . Unlike 
LIYV, this repeat sequence was not a tandem repeat and was 
separated by one nucleotide (T) in GLRaV-3 . 

The frameshift was predicted to occur at CACA (from His 
to Thr) in GLRaV-3 rather than slippery sequence AAAG in LIYV. 
However, additional experiments on in vitro expression of 
GLRaV-3 genomic RNA are needed in order to determine whether 
or not a large fusion protein is actually produced. 
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ORF 2 is predicted to encode a small peptide of 51 amino 
acids and a calculated Mr of 5,927. Database searching did 
not reveal any obvious protein matches within the existing 
Genbank (Release 84.0). 

Intergenic regions of 220 bp between ORF lb and ORF 2 and 
1065 bp between ORF 2 and ORF 3 were identified. There is no 
counterpart in the BYV or LIYV genomes; instead, an ORF of 3 3K 
in CTV (Karasev et al., J. Gen. Virol. , 75:1415-1422 (1994)) 
or 32K in LIYV (Klaassen (1995)) is observed over this similar 
region . 

ORF 3 encodes a small peptide of 45 amino acids and a 
calculated Mr of 5,090 (p5K) . Database searching revealed 
that it was most closely related to the small hydrophobic, 
transmembrane proteins of BYV (6.4K), CTV (6K) , and LIYV (5K) . 
Individual comparison (Table 3) showed that LIYV was its 
closest relative (45.8%) at the nucleotide level and BYV was 
the most homologous (30.4%) at the amino acid level. 

Table 10 compares the aligned amino acid sequences the 
small hydrophobic transmembrane protein of GLRaV-3 p5K with 
those of BYV (pGK) , CTV (p6K) , and LIYV (p5K) . Consensus 
amino acid residues are shown. Lowercase letters indicate at 
least three identical or functionally similar amino acids. 
The transmembrane domain that has been identified in several 
other closteroviruses, BYV f CTV, and LIYV (Karasev et al., 
Virology . 208:511-520 (1995)), isoverlined. 

ORF 4 encodes a protein of 549 amino acids and a 
calculated Mr of 59,113 (p59) (Figure 10, Table 4). Database 
screening revealed significant similarity to the HSP70 family, 
the p65 protein of BYV, the p65 protein of CTV, and the p62 
protein of LIYV. A multiple amino acid sequence alignment of 
GLRaV-3 p59 with HSP70 analogs of other closteroviruses showed 
striking sequence similarity among eight conserved motifs (A- 
H) . Functionally important motifs (A-C) that are 
characteristic of all proteins containing the ATPase domain of 
the HSP70 type (Bork et al., prnr. Natl. Arad . Sri. USA, 
89:7290-7294 (1992)) were also preserved in GLRaV-3 p59, which 
suggested that this HSP70 chaperonin-like protein may also 
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possess ATPase activity on its N- terminal domain and protein- 
protein interaction on its C-terminal domain (Dolja (1994) . 

Table 11 presents the amino acid sequence alignment of 
the HSP70-related protein of GLRaV-3 (p59K) with those of BYV 
5 ( P 65K) . CTV (pS5K) , and LIYV (p62K) . The eight conserved 
motifs (A to H) of cellular HSP70 are overlined. Consensus 
amino acid residues are shown. Uppercase letters indicate 
identical amino acids, and lowercase letters indicate at least 
three identical or functionally similar amino acids. 

10 Analysis of the predicted phylogenetic relationship of 

pS9 of GLRaV-3 with HSP70 -related proteins of other 
enteroviruses (BYV, CTV, and BYSV) and cellular HSP70s again 
placed the four closteroviruses together and the rest of the 
cellular HSP70S on the other branches (Figure 13). Although 

15 several closterovirus HSP70 -related proteins are closely 

related to each other and distant from other cellulax members 
of this family, inspection of the phylogenetic tree indicates 
that GLRaV-3 may be an ancestral closterovirus relatively 
early in evolution as predicted by Dolja (1994) , because 

20 GLRaV-3 was placed in between closteroviruses and the other 
cellular HSP70 members. 

ORF 5 encodes a protein of 483 amino acids with a 
calculated Mr of 54,852 (pSS) (Figure 10, Table 4). Table 12 
compares the alignment of the amino acid sequence deduced from 

25 the PCR fragment of GLRaV-3 with respective regions of HSP90 
homologues of beet yellow virus (BYV) (p64) , citrus tristeza 
virus (CTV) (p61) , and lettuce infectious yellow virus (LrYV) 
(p59) . Consensus amino acid residues are shown. Uppercase 
letters indicate identical amino acids, lowercase letters 

30 indicate at least three identical or functionally similar 
amino acids. 

No significant sequence homology with other proteins was 
observed in the current database (GenBank, release 84.0). 
Direct comparison with other counterparts (p61 of CTV, P 64 of 
35 BYV, and p59 of LIYV) of closteroviruses revealed some degree 
of amino acid sequence similarity, with 21.7% to BYV, 17.5% to 
CTV, and 16.7% to LIYV, respectively (Tables 6, 11, 12). Two 
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conserved regions of HSP90 previously described in BYV and CTV 
(Pappu (1994)) were identified in the p55 of GLRaV-3 (Table 
13) . 

ORF 6 encodes a protein of 313 amino acids with a 
calculated Mr of 34 f 866 (p35) (Figure 10 and Table 4) . The 
fact that this ORF was encoded by three overlapping GLRaV-3 
immunpositive clones indicates that it contains the coat 
protein gene of GLRaV-3. Alignment of the product of ORF 6 
(p35) with the coat protein sequences of BYV, CTV, and LIYV, 
is presented in Table 3 . The typical consensus amino acid 
residues (S, R, and D) of the coat protein of the filamentous 
plant viruses (Dolja (1991)), which may be involved in salt 
bridge formation and the proper folding of the most conserved 
core region (Boyko (1992)), were also retained in the p35 
(Table 3) . Individual sequence comparison showed the highest 
similarity to CTV (20,5%) and BYV (20.3%), and the lowest 
similarity to LIYV (17.8%). Analysis of predicted 
phylogenetic relationships with other filamentous plaint 
viruses tentatively placed GLRaV-3 into a separate, but a 
closely related branch of closteroviruses (Figure 5) . 

ORF 7 encodes a protein of 477 amino acids with a 
calculated Mr of 53,104 (p53) (Figure 10 and Table 4). Based 
on the presence of conserved seuqences, this protein is 
designated as grapevine leafroll virus coat protein repeat 
(p53) . 

ORF 8 encodes an undefined polypeptide of a calculated Mr 
of 21,248 (p21) . 

ORF 9 encodes an undefined protein of calculated Mr of 
19,588 (p20) . 

ORF 10 encodes an undefined polypeptide with a calculated 
Mr of 19,653 (p20) . 

ORF 11 encodes an undefined protein of calculated Mr of 
6963 (p7). 

In the present study, many GLRaV-3 dsRNA specific cDNA 
clones were identified using a probe generated from UNI -AMP™ 
PCR-amplified cDNA. Using UNI -AMP™ adapters and primers 
(Clontech) in PCR has several advantages. First, it is not 
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necessary to know the nucleotide sequence of an amplified 
fragment. Second, cDNA can be amplified in sufficient amounts 
for specific probe preparation. In general, cDNA amplified by 
PCR using UNI -AMP™ primers and adapters could be used for 
5 cloning as well as a probe for screening of cDNA libraries. 
However, low abundance of the starting material and many 
cycles of PCR amplification often incorporate errors into the 
nucleotide sequence (Keohavong et al., Proc. Natl. Acad. Sri 
TT5A . 86:9253-9257 (1989); Saiki et al . , Science, 239:487-491 
10 (1988)). In the present study, only UNI -AMP™ PCR amplified 

cDNA was used as a probe for screening. The cDNA library was 
generated by direct cloning of the cDNA that was synthesized 
by AMV reverse transcriptase. Therefore, the cDNA cloned 
inserts are believed to more accurately reflect the actual 
15 sequence of the dsRNA and the genomic RNA of GLRaV-3. 

A total of 15,227 nucleotides or about 80V of the 
estimated 16 kb GLRaV-3 dsRNA was cloned and sequenced. 
Identification of this sequence fragment as the GLRaV-3 genome 
was based on its sequence alignment with the coat protein gene 
20 of GLRaV-3 . This is the first direct evidence showing that 
high molecular weight dsRNA (-16 kb) isolated from GLRaV-3 
infected vines is derived from GLRaV-3 genomic RNA. Based 
upon the. nine ORFs identified, the genome organization of 
GLRaV-3 bears significant similarity to the other 
25 enteroviruses sequenced (BYV, CTV, and LIYV) (Figure 10) . 

Dolja (1994) tentatively divided the closterovirus genome 
into four modules. For GLRaV-3, the 5' accessory module 
including protease and vector transmission factor is yet to be 
identified. The core module, including key domains in RNA 
30 replication machinery (MET-HEL-RdRp) that is conserved 

throughout the alphavirus supergroup, has been revealed in 
parts of the HEL and RdRp domains. The MET domain has not yet 
been identified for GLRaV-3 . The chaperon module, including 
three ORFs coding for the small transmembrane protein, the 
35 HSP70 homologue, and the distantly related HSP90 homologue, 
has been fully sequenced. The last module includes coat 
protein and its possible diverged copy and is also preserved 
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in GLRaV-3. Overall similarity of the genome organization of 
GLRaV-3 with other closteroviruses further support the 
inclusion of GLRaV-3 as a member of closteroviruses <Hu (1990) 
and Martelli (1991) , which are hereby incorporated by 
reference) . However, observation of a predicted ambisense 
gene on its 3* terminal region may separate GLRaV-3 from other 
closteroviruses. Further comparative sequence analysis (Table 
3) as well as phylogenetic observation of GLRaV-3 with respect 
to other closteroviruses over the entire genome sequence 
region suggested that GLRaV-3 is most closely related to BYV, 
followed by CTV, and LIYV. 

As suggested by others (Agranovsky (1994), Dolja (1994) , 
Karasev (1995), and Klaassen (1995)), expression of ORF lb in 
closteroviruses may be via a +1 ribosomal frameshift 
mechanism. In GLRaV-3, a potential translation frameshift of 
ORF lb could make a fusion HEL-RdRp protein of over 1,926 
amino acid residues with a capacity to encode a protein of 
more than 210K Comparative study of GLRaV-3 with respect to 
other closteroviruses over the ORF la/lb overlap revealed a 
significant sequence similarity to LIYV, but not to BYV or to 
CTV. The so-called slippery sequence (GGGUUU) and stem- loop 
and pseudoknot structures identified in BYV (Agranovsky 
(1994) , which is hereby incorporated by reference) is not 
present in GLRaV-3. Thus, a frameshift mechanism that is 
similar to LIYV may be employed for GLRaV-3 . However, protein 
analysis is necessary in order to determine the protein 
encoding capacities of these ORFs. 

Differing from BYV, both CTV and LIYV have an extra ORF 
(ORF 2) in between RdRp (ORF lb) and the small membrane 
protein (ORF 3) and potentially encoding a protein of 33K or 
32K, respectively. However, in GLRaV-3, there is a much 
smaller ORF 2 (7K) followed by a long intergenic region of 
1065 bp. 

So far, among all plant viruses described, the HSP70 
related gene is present only in the closteroviruses (Dolja 
(1994)). Identification of the GLRaV-3 HSP70 gene was based 
on an assumption that this gene should also be present in the 
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closterovirus associated with grapevine leaf roll disease, 
specifically GLRaV-3. Thus, cDNA clones that reacted with 
HSP70 -degenerated primers were identified for sequence 
analysis. The identification of subsequent clones for 
sequencing was based on the gene -walking methodology. 
However, identification of immunopositive clones enabled 
identification of the coat protein gene of GLRaV-3 and proved 
that the HSP70-containing sequence fragment is present in the 
GLRaV-3 RNA genome-. 

The 16 kb dsRNA used for cDNA synthesis was assumed to be 
a virus replicative form (Hu (1990) . Selected clones have 
been shown by Northern hybridization to hybridize to the 16 kb 
dsRNA and several smaller RNAs (presumably subgenomic RNAs) 
(Figure 1) . Second, three GLRaV-3 antibody- reacting clones 
were identified after immuno- screening of the protein 
expression library with both GLRaV-3 polyclonal (Zee (1987) ) 
and monoclonal (Hu (1990)) antibodies. After nucleotide 
sequencing, these three antibody- reacting clones were shown to 
overlap one another and contain a common ORF which potentially 
encodes a protein with calculated Mr of 35K. This is 
consistent with the Mr estimated on SDS-PAGE (41K) . Third, 
analysis of the partial genome sequence of GLRaV-3 suggested a 
close similarity in genome organization and gene sequences to 
the other closteroviruses (Dolja (1994)). 

Information regarding the genome of GLRaV-3 provides a 
better understanding of this and related viruses and adds to 
the fundamental knowledge of closteroviruses. Present work on 
the nucleotide sequence and genome organization (about 80V of 
the estimated genome sequence) has provided direct evidence 
for a close relationship between GLRaV-3 and other 
closteroviruses. It has also enabled, for the first time, the 
evaluation of phylogenetic relationships of GLRaV-3 based on a 
wide range of genes and gene products (helicase, polymerase, 
HSP70 homologue, HSP90 homologue, and coat protein) . Based 
upon major differences in genome format and organization 
between BYV, CTV, and LIYV, along with phylogenetic analysis, 
Dolja (1994)) proposed the establishment of the new family 
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Closteroviridae with three new genera of Closterovirus (BYV) , 
Citrivirus (CTV) , and Biclovirus (LIYV) . This work on genome 
organization and phylogenetic analysis, along with evidence 
that this virus is transmitted by mealybugs (see hereinabove) 
5 indicates that a new genus under Closteroviridae family should 
be established. Thus, GLRaV-3 (the NY1 isolate) is proposed 
as the type representative of the new genus, Graclovi2rus (gra- 
pevine cl o-sterovirus) . Further sequencing of other grapevine 
leafroll associated closteroviruses may add more members to 
10 this genus . 

Another cDNA library of GLRaV-3 has been established 
recently from dsRNA of an Italian isolate of GLRaV-3 
(Saldarelli et al., Plant- Pathology (Oxford), 43:91-96 (1994), 
which is hereby incorporated by rsference) . Selected clones 
15 react specifically to GLRaV-3 dsRNA on a Northern blot; 

however, no direct evidence was provided to suggest that those 
clones were indeed from GLRaV-3 genomic RNA. Meanwhile, a 
small piece of sequence information from one of those cDNA 
clones was used to synthesize primers for the development of a 
20 PCR detection method (Minafra (1994) , which is hereby 

incorporated by reference) . Direct sequence comparison of 
these primer sequences to GLRaV-3 genome sequence obtained in 
the present study, showed that one of the primers (H229, 
5 1 ATAAGCATTCGGGATGGACC (SEQ ID NO: 27) ) is located at 
25 nucleotides 5562-5581 and the other (C547, 

5 1 ATTAACtTgACGGATGGCACGC (SEQ ID NO:28)) is in reverse 
direction and is the complement of nucleotides 5880-5901. 
Mismatching nucleotides between the primers and GLRaV-3 
sequence are shown in lowercase letters. Sequence comparison 
30 over these short primer regions to GLRaV-3 (isolate NY1) 

genome sequence showed a 90-95% identity, which suggested that 
these two isolates belong to the same virus (GLRaV-3) . 
Moreover, the primers prepared by Minafra (1994), which is 
hereby incorporated by reference, from the Italian isolate of 
35 GLRaV-3 produced an expected size of PCR product with 
templates prepared from the NY1 isolate of GLRaV-3. 
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The remainder of the GLRaV-3 genome can be readily 
sequenced using the methods described herein and/or techniques 
well known to the art. 

Rvamplft 1 * - Td ft nri f i raf i nn anr3 Character i zat ion of fhp n K 
ORE 

The complete nucleotide sequence of the GLRaV-3 HSP90- 
related gene is given in Table 4 . Initial sequencing work 
indicated that an open reading frame (ORF) encoding for a 
protein with a calculated Mr of 43K (Table 14) was downstream 
of the HSP70-related gene. This gene was selected for 
engineering because the size of its encoded product is similar 
to the GLRaV-3 coat protein gene. However, after sequence 
analysis, this incomplete ORF was located in the 3* terminal 
region of the HSP90-related gene. It is referred to herein as 
the incomplete GLRaV-3 HSP90 gene or as the 43K ORF. 

Example 15 - Qist nm-PCR EnmnPPnna the Incomplete GLPaV-3 
fffipgn r,p P p fnr Tftcprftflfiion in Plant Ti ssues 

Two custom synthesized oligonucleotide primers, 5' primer 
(93-224, tacttatctagaac cATGGAAGCGAGTCGACGACTA (SEQ ID NO:29)) 
and 3' complementary primer (93-225, 

t c 1 1 gagga t c c a t ggAGAAACATCG T CGCATACTA (SEQ ID NO: 30)) that 
flank the 43K ORF were designed to amplify the incomplete 
HSP90 gene fragment by polymerase chain reaction. Addition of 
a restriction enzyme Ncol site in the primer facilitates 
cloning and protein expression (Table 15) (Slightom, Gen^ r 
100:251-255 (1991)). Using these primers, a product of the 
proper size (1.2 kb) was amplified by RT-PCR using GLRaV-3 
double-stranded RNA (dsRNA) as template. 

Table 14 shows the nucleotide sequence fragment 
containing the 43 kDa open reading frame that used to engineer 
the plant expression cassette, pBI525GLRaV-3hsp90 . This 
fragment (nucleotides 9404 to 10,503 of the partial GLRaV-3 
genome sequence, Table 4) is located in the 3' portion of 
GLRaV-3 HSP90-related gene. Nucleotides shown in lower case 
facilitate cloning by adding Ncol restriction sites. 
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The PCR amplified product was treated with Ncol, isolated 
from a low-melting temperature agarose gel, and cloned into 
the same restriction enzyme treated binary vector pBI525 
(obtained from William Crosby, Plant Biotechnology Institute, 
Saskatoon, Sask., Canada), resulting in a clone pBI525GLRaV- 
3hsp90 (Figure 31) . A plant expression cassette, the EcoRI 
and Hindlll fragment of clone pBI525GLRaV-3hsp90 , which 
contains properly engineered CaMV 35S promoters and a Nos 3' 
untranslated region, was excised and cloned into a similar 
restriction enzyme digested plant transformation vector, 
pBinl9 (Figure 14) (Clontech Laboratories, Inc.). Two clones, 
pBinl9GLRaV-3hsp90-12-3 and pBinl9GLRaV-3hsp90-12-4 that were 
shown by PCR to contain the proper size of the incomplete 
HSP90 gene were used to transform the avirulent A. 
tumefaciens , strain LBA4404 via electroporation (Bio-Rad) . 
The potentially transformed Agrobac ter ium was plated on 
selective media with 75 /xg/ml of kanamycin. Agrobac ter ium 
lines which contain the HSP90 gene sequence were used to 
transform tobacco (Nicotiana tobacum cv. Havana 423) using 
standard procedures (Horsch et al., Scipncp r 227:1229-1231 
(1985)). Kanamycin resistant tobacco plants were analyzed by 
PCR for the presence of the transgene. Transgenic tobacco 
plants with the transgene were self pollinated and seed was 
harvested. 

Example 16 - Cus tom-PCR Enainfi^rina of the 43K ORF 

The complete sequence of the GLRaV-3 hsp90 gene was 
reported in Table 4. However, in the present study, using two 
custom synthesized oligo primers (93-224, 

tacttatctagaac cATGG AAGCG AGT CG ACGACTA (SEQ ID NO: 29) and 93- 
225, t c 1 1 g agga t c ca t ggAGAAACATCGTCGCATACTA (SEQ ID NO: 30)) and 
GLRaV-3 dsRNA as template, the incomplete HSP90 related gene 
sequence was amplified by RT-PCR which added an Ncol 
restriction enzyme recognition sequence (CCATGG) around the 
potential translation initiation codon (ATG) and another Ncol 
site, 29 nt downstream from the translation termination codon 
(TAA) (Table 14) . The PCR amplified fragment was digested 
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with Ncol, and cloned into the same restriction enzyme treated 
plant expression vector, pBI525. Under ampicillin selective 
conditions, hundreds of antibiotic resistant transf ormants of 
E. coli strain DH5a were generated. Clones derived from five 
colonies were selected for further analysis. Restriction 
enzyme mapping (Ncol or BamHI and EcoRV) showed that three out 
of five clones contained the proper size of the incomplete 
GLRaV-3 HSP90 sequence. Among them, two clones were 
engineered in the correct 5 1 -3 1 orientation with respect to 
the CaMV-AMV gene regulatory elements in the plant expression 
vector, pBI525. A graphical structure in the region of the 
plant expression cassette of clone pBI525GLRaV-3hsp90-12 is 
presented in Figure 14 . 

The GLRaV-3 HSP90 expression cassette was removed from 
clone pBI525GLRaV-3hsp90-12 by a complete digestion with 
Hindlll and EcoRI and cloned into the similar restriction 
enzyme treated plant transformation vector pBinl9. A clone 
designated as pBinl9GLRaV-3hsp90-12 was then obtained (Figure 
14) and was subsequently mobilized into the avirulent 
Agrobacterium strain LBA4404 using a standard electroporation 
protocol (Bio-Rad) . Potentially transformed Agrobacterium 
cells were then plated on a selective medium (75 ^g/ml 
kanamycin) , and antibiotic resistant colonies were analyzed 
further by PCR with specific synthesized primers (93-224 and 
93-225) to see whether or not the incomplete HSP90 gene was 
still present. After analysis, clone LBA4404/pBinl9GLRaV- 
3hsp90-12 was selected and used to transform tobacco tissues. 

Example 17 - Transformation and Charaeterirat j on of Transopm'r 

Plants 

The genetically engineered A. tumefaciens strain, 
LBA4404/pBinl9GLRaV-3hsp90-12, was co-cultivated with tobacco 
leaf discs as described (Horsch (1985)), Potentially 
transformed tobacco tissues were selected on MS regeneration 
medium (Murashige et al., pHy.gi oloq-i a PI ant- a rum r 15:473-497 
(1962)) containing 300 fig/ml of kanamycin. Numerous shoots 
developed from kanamycin resistant calli in about 6 weeks. 
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Rooted tobacco plants were obtained following growth of 
developed shoots on a rooting medium (MS without hormones) 
containing 3 00 /ig/ml of kanamycin. Eighteen independent, 
regenerated kanamycin resistant plants were transplanted in a 
greenhouse and tested for the presence of the HSP90-related 
gene by PCR. Fourteen out of eighteen selected kanamycin 
resistant putative transgenic lines were shown to contain a 
PCR product with the expected size of 1.2 kb. 

The genetically engineered Agrobacterium tumefaciens 
strain LBA4404/pBinl9GLRaV-3hsp90-12 was also used to 
transfrom the grapevine roots tock C. 3303 (Vitis riparia x 
Vitis rupestris) . Embryogenic calli of C. 3309 were obtained 
by culturing anthers on MSE medium (Murashige and Skoog salts 
plus 0.2% sucrose, 1.1 mg/L 2,-4-D, and 0.2 mg/L BA. The 
medium was adjust to pH 6.5 and 0.8V Noble agar was added. 
After autoclaving 100 ml M-0654, 100 ml M-0529 and 1 ml 
vitamin M-3 900 were added to the medium. Ater 60 days primary 
calli were indcued and transferred to hormone-f ree HMG medium 
(1/2 Murashige salts with 10 g/L sucrose, 4.6 g/L glycerol and 
0.8V Noble agar) for embryogenesis . Calli with globular or 
heart-shaped embryos were immersed for 15 min. in A. 
tumefaciens LBA4404/pBinl9GLRaV-3hsp90-12 suspended in MS 
liquid medium. The embryos were blotted on filter paper to 
remove excess liquid and transferred to HMG medium with 
acetosyxingone (100 ^M) and kept for 48 hrs. in the dark at 
28 °C. The calli were then washed 2-3 times in MS liquid 
medium plus cefotaxime (300 /ig/ml) and carbenicillin (200 
MSr/ml) and transferred to HMG medium with the same antibiotics 
for 1-2 weeks. Subsequently, the embryogenic calli were 
transferred to HMG medium containing 20 or 20 /ig/ml kanmamycin 
and 300 /xg/ml cefotaxime plus 200 /ig/ml carbenicillin to 
select for transgenic embryos. After being on selective 
medium for 3-4 months, growing embryos were transferred to 
HMG, MGC (full-strength MS salts amended with 20 g/L sucrose, 
4.6 g/L glycerol, 1 g/L casein hydrolysate and 0.8% Noble 
agar) , or MSE medium with kanamycin. After 4 months 
germinated embryos were transferred to baby food jars 
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containing rooting medium (Woody plant medium described by 
Lloyd and McCown (1981) "Commercially feasible 
micropropagation of mountain laurel, Kalmia latifola, by use 
of shoot tip culture," Proc. Intl. Plant Prop. Soc. 211:421-427, 
supplemented with 0.1 mg/L BA, 3 g/L activated charcoal and 
1.5% sucrose. The pH was adjusted to 5.8 and Noble agar was 
added to 0.7%) . Plantlets with roots were transplatned to 
pots with artificial soil mix and grown in greenhouses. 88 . 
grapevine plants have cultivated, and several have been shown 
to contain the 43 kDa protein gene (by PCR) . 

Using the methods described above, engineering of the 
incomplete HSP90 gene of GLRaV-3 into plant expression and 
transformation vectors has been effected. The targeted gene 
sequence was shown to be integrated into the plant genome by 
PCR analysis of the putative transgenic tobacco plants. The 
engineered A. tumefaciens strain LBA4404/pBinl9GLRaV-3hsp90-l2 
has been used to transform grapes and tobacco. Furthermore, 
this plant transformation vector can serve as a model for 
construction of other GLRaV-3 vectors, such as coat protein, 
RdRp, and HSP70, for which nucleotide sequences are disclosed 
herein . 

Since the first demonstration of transgenic tobacco 
plants expressing the coat protein gene of TMV resulted in 
resistance against TMV infection (Powell-Abel et al., Science, 
232:738-743 (1986), which is hereby incorporated by 
reference) , the phenomenon of the coat protein-mediated 
protection has been observed for over 20 viruses in at least 
10 different taxonomic groups in a wide variety of 
dicotyledonous plant species (Beachy et al., Armn . Ppv. 
Phytopathol. . 28:451-74 (1990); Wilson, PrQC. Natl, ftcad ■ 
flci . . TTSA . 90:3134-3141 (1993)). If gene silencing (or co- 
suppression) (Finnegan et al., Rio /Technology* 12:883-888 

(1994); Flavell, Pror . Naf_1 *r?ari SH . USA. 91:3490-3496 
(1994)) is one of the resistance mechanisms (Lindbo et al., 
THr Plant Oil , 5:1749-1759 (1993); Pang et al., 
Bin/Tftrhnoloov . 11:819-824 (1993); Smith et al., Thf Pi ant 
Cell , 6:1441-1453 (1994)), then one would expect to generate 
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transgenic plants expressing any part of a viral genome 
sequence to protect plants from that virus infection- Thus, 
in the present study, transgenic plants expressing the 43K ORF 
{or the incomplete hsp90 gene) are protected from GLRaV-3 
infection. 

Since tobacco {Nicotiana tobaccum cv. Havana 423) is not 
the host of GLRaV-3, direct evaluation of virus resistance was 
not possible in tobacco. However, after mechanical 
inoculation of N. benthamiana with grapevine leaf roll infected 
tissue, Boscia (1995) have recovered a long closterovirus from 
N. benthamiana which is probably GLRaV-2. Thus, it is 
believed that other types of grapevine leafroll associated 
closteroviruses can also be mechanically transmitted to N. 
benthamiana. With transfer of an engineered 43 K ORF from 
GLRaV-3 to N. benthamiana and to the grapevine rootstock 
Couderc 3309, it is possible to evaluate the resistance of 
those plants against GLRaV-2 infection. 

Example lfl - Coat Protein-mediated Protection a nd Ofh^T- F^rm? 

of PathQgflr.-dsriyed Resistance 

The successful engineering technique used in the above 
work can be utilized to engineer other gene sequences of 
GLRaV-3 which have since been identified. Among these, the 
coat protein gene of GLRaV-3 is the primary candidate since 
coat protein-mediated protection (Beachy (1990); Hull et al., 
Crit. Pp v. Plant Sci. . 11:17-33 (1992); Wilson (1993)) has 
been the most successful example in the application of the 
concept of pathogen-derived resistance (Sanford et al., J. 
Theor. Bi'nl . . 113:395-405 (1985)). Construction of plant 
expression vector (pEPT8/cpGLRaV-3) and Agrobacterium binary 
vector (pGA482pEPT8/cpGLRaV-3) was done following a strategy 
similar to the above. The GLRaV-3 coat protein gene was PCR 
amplified with primers (KSL95-5, 

actatttctagaac c ATGG CATTTG AACTG AAATT (SEQ ID NO: 31) , and KSL95- 
€, 1 1 c t gagga t c c a t ggTATAAG CT CC CATGAATTAT (SEQ IDNO.-32)) and 
cloned into pEPT8 after Ncol treatment. The expression 
cassette from pEPT8/cpGLRaV-3 (including double CaMV 35S 
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enhancers, 3SS promoter, alfalfa mosaic virus leader sequence, 
GLRaV-3 coat protein gene, and 35S terminator) was digested 
with HindHI and cloned into pGA4 82G (Figure IS) . The 
resulting Agrobac terivm binary vector ( P GA482GpEPT8/cpGLRaV-3) 
5 was mobilized into A. tvmefaciens strain C58Z707 and used for 
transformation of grapevines. 

Other gene sequences (e.g., ORF lb, the RNA dependent RNA 
polymerase) can also be used, as replicase-tnediated protection 
has been effectively used to protect plants from virus 

10 infection (Carretal., firnrinarn in Virology , 4:339-347 
(1993); Golemboski et al . , Pm^ Nf>H . Anati ■ S<zi ■ USA , 
87:6311-6315 (1990)) . The HSP70 homologue can also be used to 
generate transgenic plants that are resistant to all types of 
grapevine leafroll associated enteroviruses because 

IS significant sequence similarity is observed over HSP70 

conserved domains. Moreover, the phenomenon of RNA-mediated 
protection has also been observed (de Haan et al., 
n^/T^nlnov. 10:1133-1137 (1992); Farinelli et al., Mol^ 
nnnr M y r ~K- Tn^r... 6:284-292 (1993); Kawchuk et al . , 

20 H n M -™ h " Interact. 4:247-253 (1991); Lindbo et al., 

vS mingy . 189:725-733 (1992); Lindbo et al., Mol ■ Plant 

T^Pr a rt . S:144-1S3 (1992); Lindbo et al., flrmlrnrw in 
^mloav . 4:369-379 (1993); Pang (1993); Van Der Wilk et al., 
? i?r,r Mnl ttiol. . 17:431-440 (1991)). Thus, untranslatable 

25 transcript versions of the above mentioned GLRaV- 3 genes also 
produce leafroll resistant transgenic plants. 

Another form of pathogen-derived resistance effective in 
the control of plant viral disease is the use of antisense 
RNA. Transgenic tobacco plants expressing the antisense 

30 sequence of the coat protein gene of cucumber mosaic virus 
(GMV) showed a delay in symptom expression by CMV infection 
(Cuozzo et al., pWiwrhnnloov. 6:549-554 (1988)). Transgenic 
plants expressing either potato virus X (PVX) coat protein or 
its antisense transcript were protected from infection by PVX. 

35 However, plants expressing antisense RNA were protected only 
at low inoculum concentration. The extent of this protection 
mediated by antisense transcript is usually lower than 
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transgenic plants expressing the coat protein (Hemenway et 
al., rvmn J . . 7:1273-1280 (1988)). This type of resistance 
has also been observed in bean yellow mosaic virus (Hammond et 
al., Phyt-npat-holoav . 81:1174 (1991), tobacco etch virus 
5 (Lindbo et al . , iLLLOlosy., 189:725-733 (1992)) potato, virus Y 
(Farinelli (1993)), and zucchini yellow mosaic virus (Fang et 
al., MoT PlarH- M-i rr-nhg Tnf Praet . . 6:3S8-367 (1993)). 
However, high level of resistance mediated by antisense 
sequence was observed to be similar to potato plants (Russet 

10 Burbank) expressing potato leafroll virus coat protein 

(Kawchuk (1991) ) . Besides using antisense transcript of the 
virus coat protein gene, other virus genome sequences have 
also been demonstrated to be effective. These included the 
51-nucleo"tide sequences near the 5' end of TMV RNA (Nelson et 

15 al., fipnp (Abst), 127:227-232 (1993)) and noncoding region of 
turnip yellow mosaic virus genome (Zaccomer et al., G e ne , 87- 
94 (1993) ) . 

GLRaV-3 has been shown to be transmitted by mealybugs and 
in some cases it has been shown to spread rapidly in vineyards 

20 (see hereinabove) . This disease will become more of a problem 
if mealybugs become resistant to insecticides or if 
insecticide use is restricted. Thus, the development of 
leafroll resistant grapevines is environmentally sound and 
good for the economics of grape growing. 

25 Although grapevine is a natural host of Agrobacterlum (A. 

vitis is the causal agent of the grapevine crown gall 
disease) , transformation of grapevine has proven difficult 
(Baribault et al., ,t Rvt- . Bot.. 41:1045-1049 (1990); 
Baribault et al., Plnnf fr 11 *»r™t-fl. 8:137-140 (1989); Colby 

30 et al., ,T »" ^ Wort . fid.. 116:356-361 (1991); Guellec et 
al., pi.t-i- r»11 Tj nr^n ft.lt.. 20:211-216 (1990); Hebert 
et al., pm~- r*~n Ppnnr-t*. 12:585-589 (1993); Le Gall et 
al.,£lantJcifincft, 102:161-170 (1994); Martinelli et al., 
Th pot* Rppl fl>n^<« . 88:621-628 (1994); Mullins et al . , 

35 Pin/TPrhnolnov . 8:1041-1045 (1990)). Recently, an efficient 
regeneration system using proliferative somatic erabryogenesis 
and subsequent plant development has been developed from 
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zygotic embryos of stenospermic seedless grapes (Mozsar, J. 
et al.. vir-is . 33:245-246 (1994); Eraerschad (1995)). Using 
this regeneration system, Scorza et al., Plant Cftll Reynrtfi , 
14:589-592 (1995) succeeded in obtaining transgenic grapevines 
5 through zygotic-derived somatic embryos after particle- 

wounding/A. tumefaciens treatment. Using a Biolistic device, 
tiny embryos were shot with gold particles (1.0 /im in 
diameter) . The wounded embryos were then co-cultivated with 
A. tumefaciens containing engineered plasmids carrying the 

10 selection marker of kanamycin resistance and p -glucuronidase 
(GUS) genes. Selection of transgenic grapevines was carried 
out with 20 /ig/ml kanamycin in the initial stage and then 40 
/ig/ml for later proliferation. Small rooted seedlings were 
obtained, from embryogenic culture within 5 months of 

15 bombardment /A. tumefaciens (Scorza (1995) ) . Transgenic 

grapevines were analyzed by PCR and Southern hybridization, 
and shown to carry the transgenes. The above-mentioned 
grapevine transformation approach has been carried out in the 
current investigation to generate transgenic grapevines 

20 expressing GLRaV-3 genes. Evaluation of any potential 

leafroll resistance on transgenic grapevines is carried out 
using insect vectors or by grafting. 



25 



p^mplA ig - P r n*'"Hnn of ftnHhodteq Frrnqni TilTl? fiLRaV3 

E. coli harboring the clone pCP10-l, which contains the 
major portion of the coat protein gene of GLRaV3 (Figure 4) , 
was used to express the coat protein and the 0-galactosidase 
fusion protein. About 500 ml of LB medium containing 50 fig/ml 
of ampicillin was inoculated with a single colony and 
incubated with rigorous shaking for overnight until log-phase 
growth. Expression of the fusion protein was induced by the 
addition of 1 mM IPTG. Bacteria were harvested by 
centrifugation at 5,000 rpm for 10 min. The bacterial 
envelope was broken by sonication. After low speed 
3S centrifugation to remove cell debris, the fusion protein was 
precipitated by the addition of saturated ammonium sulfate, 
then resuspended in PBS buffer and electrophoresed in a SDS- 
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polyacrylamide gel (SDS-PAGE) . The fusion protein band was 
excised after soaking the SDS-PAGE gel in 0.25M KCl to locate 
the protein band. The protein was eluted with buffer (0-05M 
Tris-HCl, pH 7.9, 0.1% SDS, 0.1 mM EDT and 0.15 M NaCl) and 
5 precipitated by the addition of trichloroacetic acid to a 
final concentration of 20%. 

An antiserum was prepared by immunization of a rabbit 
with 0.5-1 mg of the purified protein emulsified with Freund's 
completed adjuvant followed by two more weekly injections of 

10 0.5-1 mg protein emulsified with Freund's incomplete adjuvant. 
After the last injection, antiserum was prepared from blood 
collected from the rabbit every week for a period of 4 months. 

On Western blot: analysis, the antibody gave a specific 
reaction to the 41K protein in GLRaV3 infected tissue as well 

15 as to the fusion protein itself (5 OK) and generated a pattern 
similar to the pattern seen in Figure 3. This antibody was 
also successfully used as a coating antibody and as an 
antibody-conjugate in an enzyme linked immunosorbent assay 
(ELISA) . 

20 The above method of producing antibody to GLRaV3 can also 

be applied to other GLRaV-3 gene sequences of the present 
invention. The method affords a large amount of highly 
purified protein from E. coli from which antibodies can be 
readily obtained. It is particularly useful in the common 

25 case where it is rather difficult to obtain sufficient amount 
of purified virus from GLRaV3 infected grapevine tissues. 

Although the invention has been described in detail for 
the purpose of illustration, it is understood that such detail 
is solely for that purpose, and variations can be made therein 

3 0 by those skilled in the art without departing from the spirit 
and scope of the invention which is defined by the following 
claims . 
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(i ) APPLICANT: Cornell Research Foundation, Inc. 
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<iii) NUMBER OF SEQUENCES: 32 
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(D) STATE: Colorado 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



15 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 ; 

20 GTGTCTACTT ACGCGAAGAG TGTGATGAAC GACAATTTCA ATATCCTTGA GACCCTGGTA 

ACTTTGCCCA AGTCCTTTAT AGTCAAAGTA CCTGGTTCGG TGCTGGTTAG CATAACCACT 
TCGGGCATTT CCGACAAACT TGAACTTCGG GGCGCGTTCG ACGTTTCTAA AAAGAATTTC 

25 oi-nr^riiTTfiT GGAGGATACG 240 



30 



GATGAAGGCT TGACGCCCGG AGAGAGGCTT GATGCACTAA CGCGCCGTGA ACATGCTGTG 
AATTCACCTG TAGGCCTCTT AGAACCTGGA GCTTCGGTTG CGAAGCGGGT CGTTTCCGGA 
40 ACGAAAGCTT TTCTGTCAGA ATTGTCATTG GAGGACTTCA CCACTTTCGT CATAAAAAAT 

AGGGTGCTTA TTGGTGTTTT TACTCTTTCC ATGGCTCTCA CTCCGGTGGT CTGGAAGTAC 

^ s . r xr TCTGGAT GTTTTCCACC GTGCTCGTTC GGGTACCGCG 
AGAAGGAATA TCGCGCGAAC TGoCGTGGAT bn 

45 



60 



120 



180 



TCCAGGAGGT TACGTTCGAG TCGTTTGCGC GTATTTTCTA GGGCTATTGT GGAGGATACG 
ATCAAGGTTA TGAAGGGCAT GAAATCAGAG GATGGTAAAC CACTCCCTAT AGCCGAGGAT 
TCCGTGTACG CGTTCATGAC AGGCAATATG TCAAACGTTC ATTGCACTAG GGCTGGTTTG 

nr-r-rn rrrrcCTTCT TTAGCTGTGA AGGGTGCAGC TTCACGCGCT 
CTCGGGGGCT CAAAGGCTTG CGCGGCTTCT uaulio 

^„-- n ™ TrrTTTCTTT CCGCCGGTGG TCTGTTTTAC 

ACTGGAACAA AACTCTTTTC AGGTCTCACA TCCTTTCTTT 

35 rnrr.prr,Tna ACATGCTGTG 540 



300 



360 



420 



480 



600 



660 



720 



780 
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GCCATCGGTT TACAATGTCT TAGTGGAGGA 
GCGTTAACAG TGACTCGAGG AGGGCTATCT 
5 GCTAGGCGTC AGGTACCATT GGCGTTGCTT 

TGCACTTTGT TAGGTATTTG GGCTCATGCT 
CTAGGGACGC TCTTCGGGGT GAGTGCCAGT 

10 

AACAGTCTGT TCACCGTACC GGAATTAACT 
CCCCAAGCAG CTTTAGGTAT TTCTCTCGTT 
15 CAACTAACGT ACGTACCGCC GATTGAAGGT 

TATCGCGACT TTGACTATGA CGATGGTGCA 
CCTGGAACCA ATACTTCGGA TACTTCTTCG 

20 

GGCGGTGGCT TCGACGCGCG CGTTGAGGCA 
AGGGGTAGTG TTGAGTTCGT CTACAGAGAA 
25 GCTGAAGTTG AAAAGGATCT AATAACACCA 

GTAGGTCCTG AAGCTGGGAG CGCGCCCAAC 
GAGAAATGTT CGGAGGTCAT CGTTGACGTT 

30 

CTTGAATCAA CCAATGGTGT CCAAGCTGCA 
TGTGGAGCTG GGGTAGCTAA ATCAGAAGTG 
35 GCACATGAAG CTGGTCTTGA GGCATCTAGT 

GTGCCAGTAG CCGTAGAGAA AACTGTTTTA 
GTAGATAAGG GCAAGGCGGT CGTGCACGCA 

40 

TTACCACGAG GGGCTCTAAA AATTAGTGAG 
AGAACGTGTT CCTGCGGCGT GCAGTTGGAC 
4 5 TTCTCAAATG CGTTTACCTT TGTCGATAGC 
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AGGTCGTTAG CTGGTGACGC TGCTCGTGGC 84 0 

TCGGCGGTTG CGGTGACCAG AAATACAGTG 900 

TCGTTTTCCA CGTCTTACGC AGTCAGTGGT 960 

CTCCCTAGGC ATTTGATGTT CTTCTTTGGC 1020 

ACCAATTCTT GGTCGCTTGG GGGCTATACG 108 0 

TGGGAAGGGA GGAGTTACAG ATCTTTATTG 114 0 

GTGCGCGGGT TGTTAAGTGA AACTGTGCCA 1200 

CGGAATGTTT ATGATCAGGC ACTAAATTTT 1260 

GGCCCATCCG GGACGGCTGG TCAAAGCGAT 1320 

GTTTTCTCTG ACGATGGTTT GCCCGCTAGT 1380 

GGTCCCAGCC ATGCTGTTGA TGAATCACCA 144 0 

CGTGTAGATG AACATCCGGC GTGTGGTGAA 1500 

CTTGGTACAG CTGTCTTAGA GTCGCCCCCC 1560 

GTCGAGGACG GTTGTCCGGA GGTTGAAGCT 1620 

CCTAGTTCAG AACCGCCGGT ACAAGAAGTC 1680 

AGAACTGAAG AGGTTGTGCA GGGCGACACA 1740 

AGTCAACGTG TGTTTCCTGC GCAAGTACCC IB 00 

GGCGCGGTCG TGGAGCCATT GCAAGTTTCT 1860 

TCTGTCGAGA AGGCGCGTGA GCTAAAGGCG 1920 

AAGGAAGTCA AGAATGTACC GGTTAAGACG 1980 

GATACCGTTC GTAAGGAATT GTGCATGTTT 204 0 

GTGTACAATG AAGCGACCAT CGCCACTAGG 2100 

TTGAAAGGGA GGAGTGCGGT CTTTTTCTCA 2160 
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AAGCTGGGTG AGGGGTATAC CTATAATGGT GGTAGCCATG TTTCATCAGG GTGGCCTCGT 2220 

GCCCTAGAGG ATATCTTAAC GGCAATTAAG TACCCAAGCG TCTTCGACCA CTGTTTAGTG 228 0 

CAGAAGTACA AGATGGGTGG AGGCGTACCA TTCCACGCTG ATGAGGAGGA GTGCTATCCA 2340 

TCAGATAACC CTATCTTGAC GGTCAATCTC GTGGGGAAGG CAAACTTCTC GACTAAGTGC 2400 

AGGAAGGGTG GTAAGGTCAT GGTCATAAAC GTAGCTTCGG GTGACTATTT TCTTATGCCT 2460 

TGCGGTTTTC AAAGGACGCA CTTGCATTCA GTAAACTCCA TCGACGAAGG GCGCATCAGT 2520 

TTGACGTTCA GGGCAACTCG GCGCGTCTTT GGTGTAGGCA GGATGTTGCA GTTAGCCGGC 2580 

GGCGTGTCGG ATGAGAAGTC ACCAGGTGTT CCAAACCAGC AACCACAGAG CCAAGGTGCT 2640 

ACCAGAACAA TCACACCAAA ATCGGGGGGC AAGGCTCTAT CTGAGGGAAG TGGTAGGGAA 2700 

GTCAAGGGGA GGTCGACATA CTCGATATGG TGCGAACAAG ATTACGTTAG GAAGTGTGAG 2760 

TGGCTCAGGG CTGATAATCC AGTGATGGCT CTTAAACCTG GCTACACCCC AATGACATTT 282 0 

GAAGTGGTTA AAGCCGGGAC CTCTGAAGAT GCCGTCGTGG AGTACTTGAA GTATCTGGCT 2860 

ATAGGCATTG GGAGGACATA CAGGGCGTTG CTTATGGCTA GAAATATTGC CGTCACTACC 2940 

GCCGAAGGTG TTCTGAAAGT ACCTAATCAA GTTTATGAAT CACTACCGGG CTTTCACGTT 3000 

TACAAGTCGG GCACAGATCT CATTTTTCAT TCAACACAAG ACGGCTTGCG TGTGAGAGAC 3060 

CTACCGTACG TATTCATAGC TGAGAAAGGT ATTTTTATCA AGGGCAAAGA TGTCGACGCG 3X20 

GTAGTAGCTT TGGGCGACAA TCTGTCCGTA TGTGATGATA TATTGGTTTT CCATGATGCT 3180 

ATTAATTTGA TGGGTGCACT GAAAGTTGCT CGATGTGGTA TGGTGGGTGA ATCATTTAAG 3240 

TCGTTCGAAT ACAAATGCTA TAATGCTCCC CCAGGTGGCG GTAAGACGAC GATGCTAGTG 33 00 

GACGAATTTG TCAAGTCACC CAATAGCACG GCCACCATTA CGGCTAACGT GGGAAGTTCT 3360 

GAGGACATAA ATATGGCGGT GAAGAAGAGA GATCCGAATT TGGAAGGTCT CAACAGTGCT 3420 

ACCACAGTTA ACTCCAGGGT GGTTAACTTT ATTGTCAGGG GAATGTATAA AAGGGTTTTG 34 60 

GTGGATGAGG TGTACATGAT GCATCAAGGC TTACTACAAC TAGGCGTCTT CGCAACCGGC 3540 
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GCGTCGGAAG GCCTCTTTTT TGGAGACATA AATCAGATAC CATTCATAAA CCGGGAGAAG 3600 

GTGTTTAGGA TGGATTGTGC TGTATTTGTT CCAAAGAAGG AAAGCGTTGT ATACACTTCT 3660 

AAATCATACA GGTGTCCGTT AGATGTTTGC TACTTGTTGT CCTCAATGAC CGTAAGGGGA 3720 

ACGGAAAAGT GTTACCCTGA AAAGGTCGTT AGCGGTAAGG ACAAACCAGT AGTAAGATCG 3780 

CTGTCCAAAA GGCCAATTGG AACCACTGAT GACGTAGCTG AAATAAACGC TGACGTGTAC 3840 

TTGTGCATGA CCCAGTTGGA GAAGTCGGAT ATGAAGAGGT CGTTGAAGGG AAAAGGAAAA 3 900 

GAAACACCAG TGATGACAGT GCATGAAGCA CAGGGAAAAA CATTCAGTGA TGTGGTATTG 3 960 

TTTAGGACGA AGAAAGCCGA TGACTCCCTA TTCACTAAAC AACCGCATAT ACTTGTTGGT 4020 

TTGTCGAGAC ACACACGCTC ACTGGTTTAT GCCGCTCTGA GCTCAGAGTT GGACGATAAG 4080 

GTCGGCACAT ATATTAGCGA CGCGTCGCCT CAATCAGTAT CCGACGCTTT GCTTCACACG 4140 
TTCGCCCCGG CTGGTTGCTT TCGAGGTATA TGA 
(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1390 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 



4173 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Val Ser Thr Tyr Ala Lys Ser val Met Asn Asp Asn Phe Asn lie Leu 
! S 10 15 

Glu Thr Leu Val Thr Leu Pro Lys Ser Phe lie Val Lys Val Pro Gly 
20 25 30 
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Ser Val Leu Val Ser lie Thr Thr Ser Gly He Ser Asp Lys Leu Glu 
35 40 45 



10 



Leu Arg Gly Ala Phe Asp Val Ser Lys Lys Asn Phe Ser Arg Arg Leu 

SO 55 SO 

Arg Ser Ser Arg Leu Arg Val Phe Ser Arg Ala He Val Glu Asp Thr 

65 70 75 80 

Zle Lys Val Met Lys Gly Met Lys Ser Glu -Asp Gly Lys Pro Leu Pro 

85 90 95 
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He Ala Glu Asp Ser Val Tyr Ala Phe Met Thr Gly Asn Met Ser Asn 
100 105 110 

Val His Cys Thr Arg Ala Gly Leu Leu Gly Gly Ser Lys Ala Cys Ala 
115 120 125 



20 



25 



Ala Ser Leu Ala Val Lys Gly Ala Ala Ser Arg Ala Thr Gly Thr Lys 
130 135 140 

Leu Phe Ser Gly Leu Thr Ser Phe Leu Ser Ala Gly Gly Leu Phe Tyr 
145 150 1S5 160 

Asp Glu Gly Leu Thr Pro Gly Glu Arg Leu Asp Ala Leu Thr Arg Arg 
165 170 175 
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Glu His Ala Val Asn Ser Pro Val Gly Leu Leu Glu Pro Gly Ala Ser 

180 185 190 

Val Ala Lys Arg Val Val Ser Gly Thr Lys Ala Phe Leu Ser Glu Leu 

195 200 205 



35 



40 



Ser Leu Glu Asp Phe Thr Thr Phe Val He Lys Asn Arg Val Leu lie 
210 215 220 

Gly Val Phe Thr Leu Ser Met Ala Leu Thr Pro Val Val Trp Lys Tyr 
225 230 235 240 

Arg Arg Asn He Ala Arg Thr Gly Val Asp Val Phe His Arg Ala Arg 

245 250 255 
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Ser Gly Thr Ala Ala He Gly Leu Gin Cys Leu Ser Gly Gly Arg Ser 
260 26S 270 
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Leu Ala Gly Asp 



71 

Ala Ala Arg Gly Ala Leu Thr val Thr Arg Gly Gly 



280 



2B5 



Ala Val Ala Val Thr Arg Asn Thr val Ala Arg Arg Gin 



275 



Leu Ser Ser 
290 

val Pro Leu Ala Leu Leu Ser Phe Ser Thr Ser Tyr Ala Val Ser Gly 

no 315 320 

305 310 

Cys Thr Leu Leu Gly He Trp Ala His Ala Leu Pro Arg His Leu Met 
325 330 335 



Phe Phe Phe Gly Leu Gly Thr Leu Phe Gly Val Ser Ala Ser Thr Asn 

345 "0 

ser Trp Ser Leu Gly Gly Tyr Thr Asn Ser Leu Phe Thr Val Pro Glu 
355 360 365 

I*u Thr Trp Glu Gly Arg Ser Tyr Arg Ser Leu Leu Pro Gin Ala Ala 
370 

L eu Gly lie Ser Leu Val Val Arg Gly Leu Leu Ser Glu Thr Val Pro 
385 

Gin Leu Thr Tyr Val Pro Pro lie Glu Gly Arg Asn Val Tyr Asp Gin 

405 41U 



Ala Leu Asn Phe Tyr Arg Asp Phe Asp Tyr Asp Asp Gly Ala Gly Pro 
420 «5 430 

Ser Gly Thr Ala Gly Gin Ser Asp Pro Gly Thr Asn Thr Ser Asp Thr 



435 



440 



445 



Ser Ser 
450 



Val Phe Ser Asp Asp Gly Leu Pro Ala Ser Gly Gly Gly Phe 



455 



460 



Asp Ala Arg Val Glu Ala Gly Pro Ser His Ala Val Asp Glu Ser Pro 
465 

Arg Gly Ser Val Glu Phe Val Tyr Arg Glu Arg Val Asp Glu His Pro 
485 490 

Ala Cys Gly Glu Ala Glu Val Glu Lys Asp Leu lie Thr Pro Leu Gly 
500 505 510 
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Thr Ala Val Leu Glu Ser Pro Pro Val Gly Pro Glu Ala Gly Ser Ala 
515 520 525 

Pro Asn Val Glu Asp Gly Cys Pro Glu Val Glu Ala Glu Lys Cys Ser 
5 530 535 540 

Glu val lie Val Asp Val Pro Ser Ser Glu Pro Pro Val Gin Glu Val 
545 550 555 560 

10 Leu Glu Ser Thr Asn Gly Val Gin Ala Ala Arg Thr Glu Glu Val Val 

56S 570 575 

Gin Gly Asp Thr Cys Gly Ala Gly Val Ala Lys Ser Glu Val Ser Qln 
580 585 590 



15 



30 



Arg Val Phe Pro Ala Gin Val Pro Ala His Glu Ala Gly. Leu Glu Ala 
595 600 60S 



Ser Ser Gly Ala Val Val Glu Pro Leu Gin Val Ser Val Pro Val Ala 
20 610 615 620 

Val Glu Lys Thr Val Leu Ser Val Glu Lys Ala Arg Glu Leu Lys Ala 
625 630 635 640 

25 val Asp Lys Gly Lys Ala Val Val His Ala Lys Glu Val Lys Asn Val 

645 650 655 

Pro Val Lys Thr Leu Pro Arg Gly Ala Leu Lys He Ser Glu Asp Thr 
660 665 670 



Val Arg Lys Glu Leu Cys Met Phe Arg Thr Cys Ser Cys Gly Val Gin 
675 680 685 



Leu Asp Val Tyr Asn Glu Ala Thr He Ala Thr Arg Phe Ser Asn Ala 
35 690 695 700 

Phe Thr Phe Val Asp Ser Leu Lys Gly Arg Ser Ala Val Phe Phe Ser 
705 710 715 720 

40 Lys Leu Gly Glu Gly Tyr Thr Tyr Asn Gly Gly Ser His Val Ser Ser 

725 730 735 

Gly Trp Pro Arg Ala Leu Glu Asp He Leu Thr Ala He Lys Tyr Pro 
740 745 750 
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Ser Val Phe Asp His Cys Leu val Gin Lys Tyr Lys Met flly Gly Gly 
75S 760 765 

Val Pro Phe His Ala Asp Asp Olu Olu Cys Tyr Pro Ser Asp Asn Pro 

770 775 
He Leu Thr Val Asn Leu Val Gly Lys Ala Asn Phe Ser Thr Lys Cys 
78S 790 7 " 



Arg Lys Gly Gly Lys Val Met Val He Asn Val Ala Ser Gly Asp Tyr 
60S 810 815 

Phe Leu Met Pro Cys Gly Phe Gin Arg Thr His Leu His Ser Val Asn 
820 825 B3 ° 

Ser lie Asp Glu Gly Arg He Ser Leu Thr Phe Arg Ala Thr Arg Arg 
835 

Val Phe Gly Val Gly Arg Met Leu Gin Leu Ala Gly Gly Val Ser Asp 



850 



855 



860 



Glu Lys ser Pro Gly Val Pro Asn Gin Gin Pro Gin Ser Gin Gly Ala 

875 880 



865 870 



Thr Arg Thr lie Thr Pro Lys Ser Gly Gly Lys Ala Leu Ser Glu Gly 
88S 890 895 

Ser Gly Arg Glu Val Lys Gly Arg Ser Thr Tyr Ser He Trp Cys Glu 
900 905 910 

Gin Asp Tyr Val Arg Lys Cys Glu Trp Leu Arg Ala Asp Asn Pro Val 
915 920 925 

Met Ala Leu Lys Pro Gly Tyr Thr Pro Met Thr Phe Glu Val Val Lys 
930 

Ala Gly Thr Ser Glu Asp Ala Val Val Glu Tyr Leu Lys Tyr Leu Ala 
945 

lie Gly lie Gly Arg Thr Tyr Arg Ala Leu Leu Met Ala Arg Asn lie 
965 970 9 " 

Ala Val Thr Thr Ala Glu Gly Val Leu Lys Val Pro Asn Gin Val Tyr 
980 9 « 990 
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Glu Ser Leu Pro Gly Phe His Val Tyr Lys Ser Gly Thr Asp Leu lie 
995 1000 1005 

Phe His Ser Thr Gin Asp Gly Leu Arg Val Arg Asp Leu Pro Tyr Val 
5 1010 101S 1020 

Phe He Ala Glu Lys Gly He Phe He Lys Gly Lys Asp Val Asp Ala 
1025 1030 1035 1040 

10 Val val Ala Leu Gly Asp Asn Leu Ser Val Cys Asp Asp He Leu Val 

1045 1050 1055 

Phe His Asp Ala He Asn Leu Met Gly Ala Leu Lys Val Ala Arg Cys 
1060 1065 1070 



15 



Gly Met Val Gly Glu Ser Phe Lys Ser Phe Glu Tyr Lys Cys Tyr Asn 
1075 1080 1085 



Ala Pro Pro Gly Gly Gly Lys Thr Thr Met Leu Val Asp Glu Phe Val 
20 1090 1095 1100 

Lys Ser Pro Asn Ser Thr Ala Thr He Thr Ala Asn Val Gly Ser Ser 
1105 1110 1H5 1120 

25 Glu Asp He Asn Met Ala Val Lys Lys Arg Asp Pro Asn Leu Glu Gly 

1125 H30 1135 

Leu Asn Ser Ala Thr Thr Val Asn Ser Arg Val Val Asn Phe He Val 
1140 H45 1150 

30 

Arg Gly Met Tyr Lys Arg Val Leu Val Asp Glu Val Tyr Met Met His 
1155 H60 1165 

Gin Gly Leu Leu Gin Leu Gly Val Phe Ala Thr Gly Ala Ser Glu Gly 
35 1170 1175 1180 

Leu Phe Phe Gly Asp He Asn Gin He Pro Phe He Asn Arg Glu Lys 
1185 1190 H95 1200 

40 Val Phe Arg Met Asp Cys Ala Val Phe Val Pro Lys Lys Glu Ser Val 

1205 1210 1215 

Val Tyr Thr Ser Lys Ser Tyr Arg Cys Pro Leu Asp Val Cys Tyr Leu 
1220 1225 1230 

45 
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Leu Ser Ser Met Thr val Arg Gly Thr Glu Lys Cys Tyr Pro Glu Lys 
1235 1240 1245 

Val val Ser Gly Lys Asp Lys Pro Val val Arg Ser Leu Ser Lys Arg 
1250 1255 1260 

Pro He Gly Thr Thr Asp Asp Val Ala Glu He Asn Ala Asp Val Tyr 
1265 1270 1275 1280 

Leu Cys Met Thr Gin Leu Glu Lys Ser Asp Met Lys Arg Ser Leu Lys 
X285 1290 1295 

Gly Lys Gly Lys Glu Thr Pro Val Met Thr Val His Glu Ala Gin Gly 
1300 1305 1310 

Lys Thr Phe Ser Asp Val Val Leu Phe Arg Thr Lys Lys Ala Asp Asp 
X315 1320 1325 

Ser Leu Phe Thr Lys Gin Pro His He Leu Val Gly Leu Ser Arg His 
1330 1335 1340 

Thr Arg Ser Leu Val Tyr Ala Ala Leu Ser Ser Glu Leu Asp Asp Lys 



1345 



1350 1355 1360 



Val Gly Thr Tyr He Ser Asp Ala Ser Pro Gin Ser Val Ser Asp Ala 
1365 1370 1375 

Leu Leu His Thr Phe Ala Pro Ala Gly Cys Phe Arg Gly He 
13B0 1385 1390 



(2) INFORMATION FOR SEQ ID NO:3: 



( i ) S EQUENCE CHARACTERISTICS : 

(A) LENGTH: 1602 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 



<iv) ANTI -SENSE: NO 



10 



WO 97/22700 PCT/US96/20747 

76 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATGAATTTTG GACCGACCTT CGAAGGGGAG TTGGTACGGA AGATACCAAC AAGTCATTTT 60 

GTAGCCGTGA ATGGGTTTCT CGAGGACTTA CTCGACGGTT GTCCGGCTTT CGACTATGAC 120 

TTCTTTGAGG ATGATTTCGA AACTTCAGAT CAGTCTTTCC TCATAGAAGA TGTGCGCATT 18 0 

TCTGAATCTT TTTCTCATTT TGCGTCGAAA ATAGAGGATA GGTTTTACAG TTTTATTAGG 24 0 

TCTAGCGTAG GTTTACCAAA GCGCAACACC TTGAAGTGTA ACCTCGTCAC GTTTGAAAAT 300 

AGGAATTCCA ACGCCGATCG CGGTTGTAAC GTGGGTTGTG ACGACTCTGT GGCGCATGAA 360 

15 CTGAAGGAGA TTTTCTTCGA GGAGGTCGTT AACAAAGCTC GTTTAGCAGA GGTGACGGAA 420 

AGCCATTTGT CCAGCAACAC GATGTTGTTA TCAGATTGGT TGGACAAAAG GGCACCTAAC 4 80 

GCTTACAAGT CTCTCAAGCG GGCTTTAGGT TCGGTTGTCT TTCATCCGTC TATGTTGACG 54 0 

20 

TCTTATACGC TCATGGTGAA AGCAGACGTA AAACCCAAGT TGGACAATAC GCCATTGTCG 600 

AAGTACGTAA CGGGGCAGAA TATAGTCTAC CACGATAGGT GCGTAACTGC GCTTTTTTCT 66 0 

25 TGCATTTTTA CTGCGTGCGT AGAGCGCTTA AAATACGTAG TGGACGAAAG GTGGCTCTTC 720 

TACCACGGGA TGGACACTGC GGAGTTGGCG GCTGCATTGA GGAACAATTT GGGGGACATC 78 0 

CGGCAATACT ACACCTATGA ACTGGATATC AGTAAGTACG ACAAATCTCA GAGTGCTCTC 84 0 

30 

ATGAAGCAGG TGGAGGAGTT GATACTCTTG ACACTTGGTG TTGATAGAGA AGTTTTGTCT 900 

ACTTTCTTTT GTGGTGAGTA TGATAGCGTC GTGAGAACGA TGACGAAGGA ATTGGTGTTG 960 

35 TCTGTCGGCT CTCAGAGGCG CAGTGGTGGT GCTAACACGT GGTTGGGAAA TAGTTTAGTC 1020 

TTGTGCACCT TGTTGTCCGT AGTACTTAGG GGATTAGATT ATAGTTATAT TGTAGTTAGC 108 0 

GGTGATGATA GCCTTATATT TAGTCGGCAG CCGTTGGATA TTGATACGTC GGTTCTGAGC 114 0 

40 

GATAATTTTG GTTTTGACGT AAAGATTTTT AACCAAGCTG CTCCATATTT TTGTTCTAAG 1200 

TTTTTAGTTC AAGTCGAGGA TAGTCTCTTT TTTGTTCCCG ATCCACTTAA ACTCTTCGTT 1260 

4 5 AAGTTTGGAG CTTCCAAAAC TTCAGATATC GACCTTTTAC ATGAGATTTT TCAATCTTTC 13 20 
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GTCGATCTTT CGAAGGGTTT CAATAGAGAG GACGTCATCC AGGAATTAGC TAAGCTGGTG 138 0 

ACGCGGAAAT ATAAGCATTC GGGATGGACC TACTCGGCTT TGTGTGTCTT GCACGTTTTA 1440 

AGTGCAAATT TTTCGCAGTT CTGTAGGTTA TATTACCACA ATAGCGTGAA TCTCGATGTG 1500 

CGCCCTATTC AGAGGACCGA GTCGCTTTCC TTGCTGGCCT TGAAGGCAAG AATTTTAAGG 1560 

TGGAAAGCTT CTCGTTTTGC CTTTTCGATA AAGAGGGGTT AA 1602 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
* (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Asn Phe Gly Pro Thr Phe Glu Gly Glu Leu Val Arg Lye He Pro 
15 10 15 

Thr Ser His Phe Val Ala Val Asn Gly Phe Leu Glu Asp Leu Leu Asp 
20 25 30 

Gly Cys Pro Ala Phe Asp Tyr Asp Phe Phe Glu Asp Asp Phe Glu Thr 
35 40 45 

Ser Asp Gin Ser Phe Leu He Glu Asp Val Arg lie Ser Glu Ser Phe 
50 55 SO 

Ser His Phe Ala Ser Lys lie Glu Asp Arg Phe Tyr Ser Phe He Arg 
65 70 75 80 

Ser Ser Val Gly Leu Pro Lys Arg Asn Thr Leu Lys Cys Asn Leu Val 
85 90 95 



WO 97/22700 



PCT/US96/20747 



78 

Thr Phe Glu Asn Arg Asn Ser Asn Ala Asp Arg Gly Cys Asn Val Gly 
100 105 110 

Cys Asp Asp Ser Val Ala Hia Glu Leu Lys Glu He Phe Phe Glu Glu 
115 120 125 

Val Val Asn Lys Ala Arg Leu Ala Glu Val Thr Glu Ser His Leu Ser 
130 135 140 



10 



Ser Asn Thr Met Leu Leu- Ser Asp Trp Leu Asp Lys Arg Ala Pro Asn 
145 150 155 160 



15 



Ala Tyr Lys Ser Leu Lys Arg Ala Leu Gly Ser Val Val Phe His Pro 
165 170 175 

Ser Met Leu Thr Ser Tyr Thr Leu Met Val Lys Ala Asp Val Lys Pro 
180 185 190 



20 



Lys Leu Asp Asn Thr Pro Leu Ser Lys Tyr Val Thr Gly Gin Asn He 
195 200 205 



Val Tyr His Asp Arg Cys Val Thr Ala Leu Phe Ser Cys He Phe Thr 
210 215 220 



25 



Ala Cys Val Glu Arg Leu Lys Tyr Val Val Asp Glu Arg Trp Leu Phe 
225 230 235 240 



30 



Tyr His Gly Met Asp Thr Ala Glu Leu Ala Ala Ala Leu Arg Asn Asn 
245 250 255 

Leu Gly Asp He Arg Gin Tyr Tyr Thr Tyr Glu Leu Asp He Ser Lys 
260 265 270 



35 



Tyr Asp Lys Ser Gin Ser Ala Leu Met Lys Gin Val Glu Glu Leu He 
275 280 285 



Leu Leu Thr Leu Gly Val Asp Arg Glu Val Leu Ser Thr Phe Phe Cys 
290 295 300 



40 



Gly Glu Tyr Asp Ser Val Val Arg Thr Met Thr Lys Glu Leu Val Leu 
305 310 315 320 



45 



Ser Val Gly Ser Gin Arg Arg Ser Gly Gly Ala Asn Thr Trp Leu Gly 
325 330 335 
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Asn Ser Leu Val Leu Cys Thr Leu Leu Ser Val Val Leu Arg Gly Leu 
340 345 350 

Asp Tyr Ser Tyr He Val Val Ser Gly Asp Asp Ser Leu He Phe Ser 
5 355 360 365 

Arg Gin Pro Leu Asp He Asp Thr Ser Val Leu Ser Asp Asn Phe Gly 
370 375 380 

10 Phe Asp Val Lys He Phe Asn Gin Ala Ala Pro Tyr Phe Cys Ser Lys 

385 390 395 400 

Phe Leu Val Gin Val Glu Asp Ser Leu Phe Phe Val Pro Asp Pro Leu 
405 410 415 



15 



Lys Leu Phe Val Lys Phe Gly Ala Ser Lys Thr Ser Asp He Asp Leu 
420 425 430 



Leu His Glu lie Phe Gin Ser Phe Val Asp Leu Ser Lys Gly Phe Asn 
20 435 440 445 

Arg Glu Asp Val He Gin Glu Leu Ala Lys Leu Val Thr Arg Lys Tyr 
450 45S 460 

25 Lys His Ser Gly Trp Thr Tyr Ser Ala Leu Cys Val Leu His Val Leu 

465 470 475 480 

Ser Ala Asn Phe Ser Gin Phe Cys Arg Leu Tyr Tyr His Asn Ser Val 
465 490 495 

30 

Asn Leu Asp Val Arg Pro He Gin Arg Thr Glu Ser Leu Ser Leu Leu 
500 505 510 

Ala Leu Lys Ala Arg He Leu Arg Trp Lys Ala Ser Arg Phe Ala Phe 
35 515 520 525 

Ser He Lys Arg Gly 
530 

4 0 (2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1650 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS : double 
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(D) TOPOLOGY: not relevant 
(ii) MOLECULE TYPE: CDNA 
5 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
ATGGAAGTAG GTATAGATTT TGGAACCACT TTCAGCACAA TCTGCTTTTC CCCATCTGGG 60 

15 

GTCAGCGGTT GTACTCCTGT GGCCGGTAGT GTTTACGTTG AAACCCAAAT TTTTATACCT 120 
GAAGGTAGCA GTACTTACTT AATTGGTAAA GCTGCGGGGA AAGCTTATCG TGACGGTGTA 180 
20 GAGGGAAGGT TGTATGTTAA CCCGAAAAGG TGGGCAGGTG TGACGAGGGA TAACGTCGAA 24 0 

CGCTACGTCG AGAAATTAAA ACCTACATAC ACCGTGAAGA TAGACAGCGG AGGCGCCTTA 300 
TTAATTGGAG GTTTAGGTTC CGGACCAGAC ACCTTATTGA GGGTCGTTGA CGTAATATGT 360 

25 

TTATTCTTGA GAGCCTTGAT ACTGGAGTGC GAAAGGTATA CGTCTACGAC GGTTACAGCA 420 
GCTGTTGTAA CGGTACCGGC TGACTATAAC TCCTTTAAAC GAAGCTTCGT TGTTGAGGCG 4 BO 

3 0 CTAAAAGGTC TTGGTATACC GGTTAGAGGT GTTGTTAACG AACCGACGGC CGCAGCCCTC 540 

TATTCCTTAG CTAAGTCGCG AGTAGAAGAC CTATTATTAG CGGTTTTTGA TTTTGGGGGA 600 
GGGACTTTCG ACGTCTCATT CGTTAAGAAG AAGGGAAATA TACTATGCGT CATCTTTTCA 66 0 

35 

GTGGGTGATA ATTTCTTGGG TGGTAGAGAT ATTGATAGAG CTATCGTGGA AGTTATCAAA 720 
CAAAAGATCA AAGGAAAGGC GTCTGATGCC AAGTTAGGGA TATTCGTATC CTCGATGAAG 780 

4 0 GAAGACTTGT CTAACAATAA CGCTATAACG CAACACCTTA TCCCCGTAGA AGGGGGTGTG 84 0 

GAGGTTGTGG ATTTGACTAG CGACGAACTG GACGCAATCG TTGCACCATT CAGCGCTAGG 900 
GCTGTGGAAG TATTCAAAAC TGGTCTTGAC AACTTTTACC CAGACCCGGT TATTGCCGTT 960 

45 
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ATGACTGGGG GGTCAAGTGC TCTAGTTAAG GTCAGGAGTG ATGTGGCTAA TTTGCCGCAG 1020 
ATATCTAAAG TCGTGTTCGA CAGTACCGAT TTTAGATGTT CGGTGGCTTG TGGGGCTAAG 



1080 



GTTTACTGCG ATACTTTGGC AGGTAATAGC GGACTGAGAC TGGTGGACAC TTTAACGAAT 1140 
ACGCTAACGG ACGAGGTAGT GGGTCTTCAG CCGGTGGTAA TTTTCCCGAA AGGTAGTCCA 1200 
ATACCCTGTT CATATACTCA TAGATACACA GTGGGTGGTG GAGATGTGGT ATACGGTATA 1260 
TTTGAAGGGG AGAATAACAG AGCTTTTCTA AATGAGCCGA CGTTCCGGGG CGTATCGAAA 1320 
CGTAGGGGAG ACCCAGTAGA GACCGACGTG GCGCAGTTTA ATCTCTCCAC GGACGGAACG 1380 
GTGTCTGTTA TCGTTAATGG TGAGGAAGTA AAGAATGAAT ATCTGGTACC CGGGACAACA 1440 
AACGTACTGG ATTCATTGGT CTATAAATCT GGGAGAGAAG ATTTAGAGGC TAAGGCAATA 1500 
CCAGAGTACT TGACCACACT GAATATTTTG CACGATAAGG CTTTCACGAG GAGAAACCTG 
GGTAACAAAG ATAAGGGGTT CTCGGATTTA AGGATAGAAG AAAATTTTTT AAAATCCGCC 
GTAGATACAG ACACGATTTT GAATGGATAA 
(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 9 amino acids 
<B) TYPE: amino acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 



1560 



1620 



1650 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Met Glu val Gly He Asp Phe Gly Thr Thr Phe Ser Thr lie Cys Phe 

10 15 
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Ser Pro Ser Gly Val Ser Gly Cys Thr Pro Val Ala Gly Ser Val Tyr 
20 25 30 

Val Glu Thr Gin lie Phe He Pro Glu Gly Ser Ser Thr Tyr Leu He 
35 40 45 

Gly Lys Ala Ala Gly Lys Ala Tyr Arg Asp Gly Val Glu Gly Arg Leu 
SO 55 60 



10 



Tyr Val Asn Pro Lys Arg Trp Ala Gly Val Thr Arg Asp Asn Val Glu 
65 70 75 80 



15 



Arg Tyr Val Glu Lys Leu LyE Pro Thr Tyr Thr Val Lys He Asp Ser 
85 90 95 

Gly Gly Ala Leu Leu He Gly Gly Leu Gly Ser Gly Pro Asp Thr Leu 
100 105 110 



20 



Leu Arg Val Val Asp Val He Cys Leu Phe Leu Arg Ala Leu He Leu 
115 120 125 



Glu Cys Glu Arg Tyr Thr Ser Thr Thr Val Thr Ala Ala Val Val Thr 
130 135 140 



25 



Val Pro Ala Asp Tyr Asn Ser Phe Lys Arg Ser Phe Val Val Glu Ala 
145 150 155 160 



30 



Leu Lys Gly Leu Gly He Pro Val Arg Gly Val Val Asn Glu Pro Thr 

165 170 175 

Ala Ala Ala Leu Tyr Ser Leu Ala Lys Ser Arg Val Glu Asp Leu Leu 
180 185 190 



35 



Leu Ala Val Phe Asp Phe Gly Gly Gly Thr Phe Asp Val Ser Phe Val 
195 200 205 



Lys Lys Lys Gly Asn He Leu Cys Val He Phe Ser Val Gly Asp Asn 
210 215 220 



40 



Phe Leu Gly Gly Arg Asp lie Asp Arg Ala He Val Glu Val He Lys 
225 230 235 240 



45 



Gin Lys He Lys Gly Lys Ala Ser Asp Ala Lys Leu Gly He Phe Val 
245 250 255 
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Ser Ser Met Lys Glu Asp Leu Ser Asn Asn Asn Ala He Thr Gin His 
260 265 270 

Leu lie Pro Val Glu Gly Gly Val Glu Val Val Asp Leu Thr Ser Asp 
5 275 280 285 

Glu Leu Asp Ala He Val Ala Pro Phe Ser Ala Arg Ala Val Glu Val 
290 295 300 

10 Phe Lys Thr Gly Leu Asp Asn Phe Tyr Pro Asp Pro Val He Ala Val 

305 310 315 320 

Met Thr Gly Gly Ser Ser Ala Leu Val Lys Val Arg Ser Asp Val Ala 
325 330 335 



15 



30 



Asn Leu Pro Gin He Ser Lys Val Val Phe Asp Ser Thr Asp Phe Arg 
340 345 350 



Cys Ser Val Ala Cys Gly Ala Lys Val Tyr Cys Asp Thr Leu Ala Gly 
20 355 360 365 

Asn Ser Gly Leu Arg Leu Val Asp Thr Leu Thr Asn Thr Leu Thr Asp 
370 375 380 

25 Glu Val Val Gly Leu Gin Pro Val Val He Phe Pro Lys Gly Ser Pro 

385 390 395 400 

He Pro Cys Ser Tyr Thr His Arg Tyr Thr Val Gly Gly Gly Asp Val 
405 410 415 



Val Tyr Gly He Phe Glu Gly Glu Asn Asn Arg Ala Phe Leu Asn Glu 
420 425 430 



Pro Thr Phe Arg Gly Val Ser Lys Arg Arg Gly Asp Pro Val Glu Thr 
35 435 440 445 

Asp Val Ala Gin Phe Asn Leu Ser Thr Asp Gly Thr Val Ser Val He 
450 455 460 

4 0 Val Asn Gly Glu Glu Val Lys Asn Glu Tyr Leu Val Pro Gly Thr Thr 

465 470 475 480 



45 



Asn Val Leu Asp Ser Leu Val Tyr Lys Ser Gly Arg Glu Asp Leu Glu 
4B5 490 495 
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Ala Lys Ala lie Pro Glu Tyr Leu Thr Thr Leu Asn He Leu His Asp 
500 S05 510 

Lys Ala Phe Thr Arg Arg Asn Leu Gly Asn Lys Asp Lys Gly Phe Ser 
5 515 520 525 

Asp Leu Arg He Glu Glu Asn Phe Leu Lys Ser Ala Val Asp Thr Asp 
530 535 540 

10 Thr He Leu Asn Gly 

545 

(2) INFORMATION FOR SEQ ID NO: 7: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1452 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: not relevant 

20 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
25 (iv) ANTI-SENSE: NO 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATGGATAAAT ATATTTATGT AACGGGGATA TTAAACCCTA ACGAGGCTAG AGACGAGGTA 
TTCTCGGTAG TGAATAAGGG ATATATTGGA CCGGGAGGGC GCTCCTTTTC GAATCGTGGT 

35 

AGTAAGTACA CCGTCGTCTG GGAAAACTCT GCTGCGAGGA TTAGTGGATT TACGTCGACT 
TCGCAATCTA CGATAGATGC TTTCGCGTAT TTCTTGTTGA AAGGCGGATT GACTACCACG 
4 0 CTCTCTAACC CAATAAACTG TGAGAATTGG GTCAGGTCAT CTAAGGATTT AAGCGCGTTT 

TTCAGGACCC TAATTAAAGG TAAGATTTAT GCATCGCGTT CTGTGGACAG CAATCTTCCA 
AAGAAAGACA GGGATGACAT CATGGAAGCG AGTCGACGAC TATCGCCATC GGACGCCGCC 

45 



10 
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TTTTGCAGAG CAGTGTCGGT TCAGGTAGGG AAGTATGTGG ACGTAACGCA GAATTTAGAA 4 BO 

AGTACGATCG TGCCGTTAAG AGTTATGGAA ATAAAGAAAA GACGAGGATC AGCACATGTT 540 

AGTTTACCGA AGGTGGTATC CGCTTACGTA GATTTTTATA CGAACTTGCA GGAATTGCTG 600 

TCGGATGAAG TAACTAGGGC CAGAACCGAT ACAGTTTCGG CATACGCTAC CGACTCTATG 660 

GCTTTCTTAG TTAAGATGTT ACCCCTGACT GCTCGTGAGC AGTGGTTAAA AGACGTGCTA 720 

GGATATCTGC TGGTACGGAG ACGACCAGCA AATTTTTCCT ACGACGTAAG AGTAGCTTGG 780 

GTATATGACG TGATCGCTAC GCTCAAGCTG GTCATAAGAT TGTTTTTCAA CAAGGACACA 840 

15 CCCGGGGGTA TTAAAGACTT AAAACCGTGT GTGCCTATAG AGTCATTCGA CCCCTTTCAC 900 

GAGCTTTCGT CCTATTTCTC TAGGTTAAGT TACGAGATGA CGACAGGTAA AGGGGGAAAG 960 

ATATGCCCGG AGATCGCCGA GAAGTTGGTG CGCCGTCTAA TGGAGGAAAA CTATAAGTTA 102 0 

20 

AGATTGACCC CAGTGATGGC CTTAATAATT ATACTGGTAT ACTACTCCAT TTACGGCACA 1080 

AACGCTACCA GGATTAAAAG ACGCCCGGAT TTCCTCAATG TGAGGATAAA GGGAAGAGTC 1140 

25 GAGAAGGTTT CGTTACGGGG GGTAGAAGAT CGTGCCTTTA GAATATCAGA AAAGCGCGGG 1200 

ATAAACGCTC AACGTGTATT ATGTAGGTAC TATAGCGATC TCACATGTCT GGCTAGGCGA 1260 

CATTACGGCA TTCGCAGGAA CAATTGGAAG ACGCTGAGTT ATGTAGACGG GACGTTAGCG 1320 

30 

TATGACACGG CTGATTGTAT AACTTCTAAG GTGAGAAATA CGATCAACAC CGCAGATCAC 1380 

GCTAGCATTA TACACTATAT CAAGACGAAC GAAAACCAGG TTACCGGAAC TACTCTACCA 1440 

35 CACCAGCTTT AA 1452 
U) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 4 83 amino acids 

(B) TYPE: amino acid 
(C> STRAND EDNESS : not relevant 
(D) TOPOLOGY: not relevant 

45 (ii) MOLECULE TYPE: protein 
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86 



PCT/US96/20747 



10 



25 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Asp Lys Tyr He Tyr Val Thr Gly He Leu Asn Pro Asn Glu Ala 
1 5 10 15 

Arg Asp Glu Val Phe Ser Val Val Asa Lys Gly Tyr He Gly Pro Gly 
20 25 30 



Gly Arg Ser Phe Ser Asn Arg Gly Ser Lys Tyr Thr Val Val Trp Glu 
15 35 40 45 

Asn Ser Ala Ala Arg He Ser Gly Phe Thr Ser Thr Ser Gin Ser Thr 
50 55 60 

20 He Asp Ala Phe Ala Tyr Phe Leu Leu Lys Gly Gly Leu Thr Thr Thr 

65 70 7S 80 

Leu Ser Asn Pro He Asn Cys Glu Asn Trp Val Arg Ser Ser Lys Asp 
AS 90 95 



Leu Ser Ala Phe Phe Arg Thr Leu He Lys Gly Lys He Tyr Ala Ser 
100 105 HO 



Arg Ser Val Asp Ser Asn Leu Pro Lys Lys Asp Arg Asp Asp He Met 
30 H5 I 20 125 

Glu Ala Ser Arg Arg Leu Ser Pro Ser Asp Ala Ala Phe Cys Arg Ala 
130 13S 140 

35 val Ser Val Gin Val Gly Lys Tyr Val Asp Val Thr Gin Asn Leu Glu 

145 150 1S5 160 

Ser Thr He Val Pro Leu Arg Val Met Glu He Lys Lys Arg Arg Gly 
165 170 175 

40 

Ser Ala His Val Ser Leu Pro Lys Val Val Ser Ala Tyr Val Asp Phe 
180 185 190 

Tyr Thr Asn Leu Gin Glu Leu Leu Ser Asp Glu Val Thr Arg Ala Arg 
45 195 200 205 
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Thr Asp Thr Val Ser Ala Tyr Ala Thr Asp Ser Met Ala Phe Leu Val 



Asp 

210 



21S 220 



Lya Met Leu Pro Leu Thr Ala Arg Glu Gin Trp Leu Lye Asp Val Leu 
225 230 23S 240 

Gly Tyr Leu Leu Val Arg Arg Arg Pro Ala Asn Phe Ser Tyr Asp Val 
245 250 255 

Arg Val Ala Trp Val Tyr Asp Val He Ala Thr Leu Lys Leu Val He 
260 265 270 

Arg Leu Phe Phe Asn Lys Asp Thr Pro Gly Gly He Lys Asp Leu Lys 
275 280 285 

Pro Cys Val Pro He Glu Ser Phe Asp Pro Phe His Glu Leu Ser Ser 
. 290 295 300 

Tyr Phe Ser Arg Leu Ser Tyr Glu Met Thr Thr Gly Lys Gly Gly Lys 



305 



310 315 320 



He Cys Pro Glu He Ala Glu Lys Leu Val Arg Arg Leu Met Glu Glu 
32 s 330 335 

Asn Tyr Lys Leu Arg Leu Thr Pro Val Met Ala Leu He He He Leu 
340 3« 350 

Val Tyr Tyr Ser He Tyr Gly Thr Asn Ala Thr Arg He Lys Arg Arg 



355 



360 



365 



Pro Asp Phe Leu Asn Val Arg He Lys Gly Arg Val Glu Lys Val Ser 
370 375 380 

Leu Arg Gly Val Glu Asp Arg Ala Phe Arg He Ser Glu Lys Arg Gly 
385 390 395 400 

He Asn Ala Gin Arg Val Leu cys Arg Tyr Tyr Ser Asp Leu Thr Cys 
405 410 415 

Leu Ala Arg Arg His Tyr Gly He Arg Arg Asn Asn Trp Lys Thr Leu 
420 «5 «0 



Ser Tyr Val Asp Gly Thr Leu Ala Tyr Asp Thr Ala Asp Cys He Thr 
435 



440 44S 
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Ser Lys Val Arg Asn Thr He Asn Thr Ala Asp His Ala Ser He He 
450 455 460 

His Tyr He Lys Thr Asn Glu Asn Gin Val Thr Gly Thr Thr Leu Pro 
465 470 475 480 

His Gin Leu 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 942 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: cDNA 

<iv> ANT I -SENSE: NO 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
ATGGCATTTG AACTGAAATT AGGGCAGATA TATGAAGTCG TCCCCGAAAA TAATTTGAGA 60 
GTTAGAGTGG GGGATGCGGC ACAAGGAAAA TTTAGTAAGG CGAGTTTCTT AAAGTACGTT 120 
AAGGACGGGA CACAGGCGGA ATTAACGGGA ATCGCCGTAG TGCCCGAAAA ATACGTATTC 180 
GCCACAGCAG CTTTGGCTAC AGCGGCGCAG GAGCCACCTA GGCAGCCACC AGCGCAAGTG 240 
GCGGAACCAC AGGAAACCGA TATAGGGGTA GTGCCGGAAT CTGAGACTCT CACACCAAAT 300 
AAGTTGGTTT TCGAGAAAGA TCCAGACAAG TTCTTGAAGA CTATGGGCAA GGGAATAGCT 
TTGGACTTGG CGGGAGTTAC CCACAAACCG AAAGTTATTA ACGAGCCAGG GAAAGTATCA 
GTAGAGGTGG CAATGAAGAT TAATGCCGCA TTGATGGAGC TGTGTAAGAA GGTTATGGGC 
GCCGATGACG CAGCAACTAA GACAGAATTC TTCTTGTACG TGATGCAGAT TGCTTGCACG 54 0 

TTCTTTACAT CGTCTTCGAC GGAGTTCAAA GAGTTTGACT ACATAGAAAC CGATGATGGA 600 



360 



420 



480 
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AAGAAGATAT ATGCGGTGTG GGTATATGAT TGCATTAAAC AAGCTGCTGC TTCGACGGGT 660 

TATGAAAACC CGGTAAGGCA GTATCTAGCG TACTTCACAC CAACCTTCAT CACGGCGACC 720 

CTGAATGGTA AACTAGTGAT GAACGAGAAG GTTATGGCAC AGCATGGAGT ACCACCGAAA 780 

TTCTTTCCGT ACACGATAGA CTGCGTTCGT CCGACGTACG ATCTGTTCAA CAACGACGCA 84 0 

ATATTAGCAT GGAATTTAGC TAGACAGCAG GCGTTTAGAA ACAAGACGGT AACGGCCGAT 900. 

AACACCTTAC ACAACGTCTT CCAACTATTG CAAAAGAAGT AG 942 
(2) INFORMATION FOR SEQ ID NO: 10: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 



10 



20 



25 



35 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

30 Met Ala Phe Glu Leu Lys Leu Gly Gin He Tyr Glu Val Val Pro Glu 

15 10 IS 

Asn Asn Leu Arg Val Arg Val Gly Asp Ala Ala Gin Gly Lys Phe Ser 
20 25 30 



Lys Ala Ser Phe Leu Lys Tyr Val Lys Asp Gly Thr Gin Ala Glu Leu 
35 40 45 



Thr Gly He Ala Val Val Pro Glu Lys Tyr Val Phe Ala Thr Ala Ala 
40 50 55 60 

Leu Ala Thr Ala Ala Gin Glu Pro Pro Arg Gin Pro Pro Ala Gin Val 
65 70 75 80 



45 
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Ala Glu Pro Gin Glu Thr Asp lie Gly Val Val Pro Glu Ser Glu Thr 
85 90 95 

Leu Thr Pro Asn Lys Leu Val Phe Glu Lys Asp Pro Asp Lys Phe Leu 
5 100 105 110 

Lys Thr Met Gly Lys Gly lie Ala Leu Asp Leu Ala Gly Val Thr His 
115 120 125 

10 Lys Pro Lys Val He Asn Glu Pro Gly Lys Val Ser Val Glu Val Ala 

130 135 140 

Met Lys He Asn Ala Ala Leu Met Glu Leu Cys Lys Lys Val Met Gly 
145 ISO 155 igo 



15 



30 



Ala Asp Asp Ala Ala Thr Lys Thr Glu Phe Phe Leu Tyr Val Met Gin 
165 170 175 



He Ala Cys Thr Phe Phe Thr Ser Ser Ser Thr Glu Phe Lyc Glu Phe 
20 180 185 190 

Asp Tyr He Glu Thr Asp Asp Gly Lys Lys He Tyr Ala Val Trp Val 
195 200 205 

25 Tyr Asp Cys He Lys Gin Ala Ala Ala Ser Thr Gly Tyr Glu Asn Pro 

210 215 220 

Val Arg Gin Tyr Leu Ala Tyr Phe Thr Pro Thr Phe He Thr Ala Thr 
225 230 235 240 



Leu Asn Gly Lys Leu Val Met Asn Glu Lys Val Met Ala Gin His Gly 
245 250 255 



Val Pro Pro Lys Phe Phe Pro Tyr Thr He Asp Cys Val Arg Pro Thr 

35 260 265 270 

Tyr Asp Leu Phe Asn Asn Asp Ala He Leu Ala Trp Asn Leu Ala Arg 
275 280 285 

4 0 Gin Gin Ala Phe Arg Asn Lys Thr Val Thr Ala Asp Asn Thr Leu His 

290 295 300 



45 



Asn Val Phe Gin Leu Leu Gin Lys Lys 
305 310 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 base pairs 
5 (B) TYPE: nucleic acid 

<C) STRANDEDNESS : double 
(D) TOPOLOGY: not relevant 



10 



15 



25 



35 



(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
20 ATGTACAGTA GAGGGTCTTT CTTTAAGTCT CGGGTTACCC TTCCTACTCT TGTCGGAGCA 60 

TACATGTGGG AGTTTGAACT CCCGTATCTT ACGGACAAGA GACACATCAG CTATAGCGCG 120 
CCAAGTGTCG CGACTTTTAG CCTTGTGTCG AGGTAG 156 
(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 
30 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 



40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Tyr Ser Arg Gly Ser Phe Phe Lys Ser Arg Val Thr Leu Pro Thr 
15 10 15 



45 
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Leu Val Gly Ala Tyr Met Trp Glu Phe Glu Leu Pro Tyr Leu Thr Aap 
20 25 30 

Lys Arg His lie Ser Tyr Ser Ala Pro Ser Val Ala Thr Phe Ser Leu 
35 40 45 

Val Ser Arg 
50 

(2) INFORMATION FOR SEQ ID NO : 13 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ATGGATGATT TTAAACAGGC AATACTGTTG CTAGTAGTCG ATTTTGTCTT CGTGATAATT 60 
CTGCTGCTGG TTCTTACGTT CGTCGTCCCG AGGTTACAGC AAAGCTCCAC CATTAATACA 120 
GGTCTTAGGA CAGTGTGA 138 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 
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{xi} SEQUENCE DESCRIPTION: SEQ ID NO:14: 

Met Asp Asp Phe Lys Gin Ala lie Leu Leu Leu Val Val Asp Phe Val 
5 15 10 15 

Phe Val He lie Leu Leu Leu Val Leu Thr Phe Val Val Pro Arg Leu 
20 25 30 

10 Gin Gin Ser Ser Thr He Asn Thr Gly Leu Arg Thr Val 

35 40 45 

(2) INFORMATION FOR SEQ ID NO: 15: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1434 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; not relevant 

20 

<ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
25 (iv) ANTI-SENSE: NO 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATGGGAGCTT ATACACATGT AGACTTTCAT GAGTCGCGGT TGCTGAAAGA CAAACAAGAC 60 

TATCTTTCTT TCAAGTCAGC GGATGAAGCT CCTCCTGATC CTCCCGGATA CGTTCGCCCA 120 

35 

GATAGTTATG TGAGGGCTTA TTTGATACAA AGAGCAGACT TTCCCAATAC TCAAAGCTTA 18 0 

TCAGTTACGT TATCGATAGC CAGTAATAAG TTAGCTTCAG GTCTTATGGG AAGCGACGCA 24 0 

40 GTATCATCGT CGTTTATGCT GATGAACGAC GTGGGAGATT ACTTCGAGTG CGGCGTGTGT 300 

CACAACAAAC CCTACTTAGG ACGGGAAGTT ATCTTCTGTA GGAAATACAT AGGTGGGAGA 360 

GGAGTGGAGA TCACCACTGG TAAGAACTAC ACGTCGAACA ATTGGAACGA GGCGTCGTAC 420 



45 
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GTAATACAAG TGAACGTAGT CGATGGGTTA GCACAGACCA 
CAAACGGACG TTAGTGGTCT ACCCAAAAAT TGGACGCGTA 
5 GTGTCCGTAG ATCAGAACCT CTACCCTGGT TGTTTCTCAG 

CGTATAAGGT CACTGTTAGT TTCCCCAGTG CGCATCTTCT 
CCTTTGAAGA AATCGTTCAA CGCAAGAATC GAGGATGTGC 

10 

TTGTTAGTAC CGAGTCCTGT CGTACCAGAG TCTACGGGAG 
CTGGATGTAG TGGCTTTAAC GTCOGACGTA ACGGAATTGA 
15 AAGATATGTT TTCCAGACTC AGTGTTATCG ATCAATGAAG 

TATTTGCCGA TAACGGAAGC TCTACAGATA AACGCAAGAC 
AAAGGCGGGA GTCAAACACC ACGAGATATG GGGAATATGA 

20 

TTCGTACTCT ACTCTACTGT AAAGAATATA AGCGTCAAAG 
GAATTAGGTC AAAAGAGAGT CTACTTAAGT TATTCGGAAG 
25 GGGAAATACG GTGCGTCTCC AACCAACACT GTGCGATCCT 

ACCACTATTA CTCTACTTAT AGAGAAGAAA ATTCAGCCAG 
CACGGCGTCC CGAAGAGGTT CACTCCGTAC TGCTTCGACT 

30 

TATTACCCGG CGGACGTGTT GAAGGCTAAC GCAATGGCTT 
GCTAATTTAA GGCGTAAAGG TTCGGAGACG TATAACATCT 
3 5 (2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 amino acids 

(B) -TYPE : amino acid 

40 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

45 (iii) HYPOTHETICAL: YES 



PCT/US96/20747 

CTGTTAATTC TACTTATACG 480 

TCTACAAAAT AACAAAGATA 540 

ACTCGAAACT GGGTGTAATG 600 

TTAGGGATAT CTTATTGAAA 660 

TGAATATTGA CGACACGTCG 720 

GTGTAGGTCC ATCAGAGCAG 780 

TCAACACTAG GGGGCAAGGT 84 0 

CGGATATCTA CGATGAGCGG 900 

TACGCAGACT CGTTCTTTCG 960 

TAGTGGCCAT GATACAACTT 1020 

ACGGGTATAG GGTGGAGACC 108 0 

TAAGGGAAGC TATATTAGGA 1140 

TCATGAGGTA TTTTGCTCAC 1200 

CGTGTACTGC CCTAGCTAAG 1260 

TCG CACTACT GGATAACAGA 1320 

GCGCTATAGC GATTAAATCA 138 0 

TAGAAAGCAT TTGA 1434 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Gly Ala Tyr Thr His Val Asp Phe His Glu Ser Arg Leu Leu Lys 
5 1 S 10 15 

Asp Lys Gin Asp Tyr Leu Ser Phe Lys Ser Ala Asp Glu Ala Pro Pro 
20 25 30 

10 Asp Pro Pro Gly Tyr Val Arg Pro Asp Ser Tyr Val Arg Ala Tyr Leu 

35 40 45 

He Gin Arg Ala Asp Phe Pro Asn Thr Gin Ser Leu Ser Val Thr Leu 
50 55 60 



15 



30 



Ser He Ala Ser Asn Lys Leu Ala Ser Gly Leu Met Gly Ser Asp Ala 
65 70 75 80 



Val Ser Ser Ser Phe Met Leu Met Asn Asp Val Gly Asp Tyr Phe Glu 
20 B5 90 95 

Cys Gly Val Cys His Asn Lys Pro Tyr Leu Gly Arg Glu Val He Phe 
100 105 HO 

25 Cys Arg Lys Tyr He Gly Gly Arg Gly Val Glu He Thr Thr Gly Lys 

US 120 125 

Asn Tyr Thr Ser Asn Asn Trp Asn Glu Ala Ser Tyr Val He Gin Val 
130 135 140 



Asn Val Val Asp Gly Leu Ala Gin Thr Thr Val Asn Ser Thr Tyr Thr 
145 ISO 155 160 



Gin Thr Asp Val Ser Gly Leu Pro Lys Asn Trp Thr Arg He Tyr Lys 
35 1G5 170 175 

He Thr Lys He Val Ser Val Asp Gin Asn Leu Tyr Pro Gly Cys Phe 
180 185 190 

40 Ser Asp Ser Lys Leu Gly Val Met Arg He Arg Ser Leu Leu Val Ser 

195 200 205 

Pro Val Arg He Phe Phe Arg Asp He Leu Leu Lys Pro Leu Lys Lys 
210 215 22 0 

45 



WO 97/22700 PCT/US 96/2 0747 

96 

Ser Phe Asn Ala Arg He Glu Asp Val Leu Asn He Asp Asp Thr Ser 
225 230 235 240 

Leu Leu Val Pro Ser Pro Val Val Pro Glu Ser Thr Gly Gly Val Gly 
5 245 250 255 

Pro Ser Glu Gin Leu Asp Val Val Ala Leu Thr Ser Asp Val Thr Glu 
260 265 270 

10 Leu He Asn Thr Arg Gly Gin Gly Lys He Cys Phe Pro Asp Ser Val 

27S 280 285 

Leu Ser lie Asn Glu Ala Asp He Tyr Asp Glu Arg Tyr Leu Pro He 
290 295 300 



15 



30 



Thr Glu Ala Leu Gin He Asn Ala Arg Leu Arg Arg Leu Val Leu Ser 
305 310 315 320 



Lys Gly Gly Ser Gin Thr Pro Arg Asp Met Gly Asn Met He Val Ala 
20 325 330 335 

Met He Gin Leu Phe Val Leu Tyr Ser Thr Val Lys Asn He Ser Val 
340 345 350 

25 Lys Asp Gly Tyr Arg Val Glu Thr Glu Leu Gly Gin Lys Arg Val Tyr 

355 360 36S 

Leu Ser Tyr Ser Glu Val Arg Glu Ala He Leu Gly Gly Lys Tyr Gly 
370 375 380 



Ala Ser Pro Thr Asn Thr Val Arg Ser Phe Met Arg Tyr Phe Ala His 
385 390 395 400 



Thr Thr lie Thr Leu Leu He Glu Lys Lys He Gin Pro Ala Cys Thr 
35 405 410 415 

Ala Leu Ala Lys His Gly Val Pro Lys Arg Phe Thr Pro Tyr Cys Phe 
420 425 430 

40 Asp Phe Ala Leu Leu Asp Asn Arg Tyr Tyr Pro Ala Asp Val Leu Lys 

435 440 445 



45 



Ala Asn Ala Met Ala Cys Ala lie Ala He Lys Ser Ala Asn Leu Arg 
450 455 460 
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Arg Lys Gly Ser Glu Thr Tyr Asn lie Leu Glu Ser lie 

465 470 475 

(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 558 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

ATGGAATTCA GACCAGTTTT AATTACAGTT CGCCGTGATC CCGGCGTAAA CACTGGTAGT 60 

TTGAAAGTGA TAGCTTATGA CTTACACTAC GACAATATAT TCGATAACTG CGCGGTAAAG 120 

TCGTTTCGAG ACACCGACAC TGGATTCACT GTTATGAAAG AATACTCGAC GAATTCAGCG 180 

TTCATACTAA GTCCTTATAA ACTGTTTTCC GCGGTCTTTA ATAAGGAAGG TGAGATGATA 24 0 

AGTAACGATG TAGGATCGAG TTTCAGGGTT TACAATATCT TTTCGCAAAT GTGTAAAGAT 300 

ATCAACGAGA TCAGCGAGAT ACAACGCGCC GGTTACCTAG AAACATATTT AGGAGACGGG 360 

CAGGCTGACA CTGATATATT TTTTGATGTC TTAACCAACA ACAAAGCAAA GGTAAGGTGG 420 

TTAGTTAATA AAGACCATAG CGCGTGGTGT GGGATATTGA ATGATTTGAA GTGGGAAGAG 4 80 

AGCAACAAGG AGAAATTTAA GGGGAGAGAC ATACTAGATA CTTACGTTTT ATCGTCTGAT 54 0 

TATCCAGGGT TTAAATGA 558 
(2) INFORMATION FOR SEQ ID NO: 18: 
<i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

5 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 

10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

15 Met Glu Phe Arg Pro Val Leu He Thr Val Arg Arg Asp Pro Gly Val 

I 5 10 15 

Asn Thr Gly Ser Leu Lys Val He Ala Tyr Asp Leu His Tyr Asp Asn 
20 25 30 



20 



35 



He Phe Asp Asn Cys Ala Val Lys Ser Phe Arg Asp Thr Asp Thr Gly 
35 40 45 



Phe Thr Val Met Lys Glu Tyr Ser Thr Asn Ser Ala Phe He Leu Ser 
25 50 55 60 

Pro Tyr Lys Leu Phe Ser Ala Val Phe Asn Lys Glu Gly Glu Met He 
65 70 75 80 

30 Ser Asn Asp Val Gly Ser Ser Phe Arg Val Tyr Asn He Phe Ser Gin 

85 90 95 

Met Cys Lys Asp He Asn Glu He Ser Glu He Gin Arg Ala Gly Tyr 
100 105 HO 



Leu Glu Thr Tyr Leu Gly Asp Gly Gin Ala Asp Thr Asp He Phe Phe 
115 120 125 



Asp Val Leu Thr Asn Asn Lys Ala Lys Val Arg Trp Leu Val Asn Lys 
40 130 135 140 

Asp His Ser Ala Trp Cys Gly He Leu Asn Asp Leu Lys Trp Glu Glu 

!4S 150 155 160 
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Ser Asn Lys Glu Lya Phe Lya Gly Arg Asp lie Leu Asp Thr Tyr Val 
165 170 175 

Leu Ser Ser Asp Tyr Pro Gly Phe Lya 
180 IBS 

(2) INFORMATION FOR SEQ ID NO: 19: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

ATGAAGTTGC TTTCGCTCCG CTATCTTATC TTAAGGTTGT CAAAGTCGCT TAGAACGAAC 60 

GATCACTTGG TTTTAATACT TATAAAGGAG GCGCTTATAA ACTATTACAA CGCCTCTTTC 120 

ACCGATGAGG GTGCCGTATT AAGAGACTCT CGCGAAAGTA TAGAGAATTT TCTCGTAGCC 180 

AGGTGCGGTT CGCAAAATTC CTGCCGAGTC ATGAAGGCTT TGATCACTAA CACAGTCTGT 240 

AAGATGTCGA TAGAAACAGC CAGAAGTTTT ATCGGAGACT TAATACTCGT CGCCGACTCC 300 

TCTGTTTCAG CGTTGGAAGA AGCGAAATCA ATTAAAGATA ATTTCCGCTT AAGAAAAAGG 36 0 

AGAGGCAAGT ATTATTATAG TGGTGATTGT GGATCCGACG TTGCGAAAGT TAAGTATATT 420 

TTGTCTGGGG AGAATCGAGG ATTGGGGTGC GTAGATTCCT TGAAGCTAGT TTGCGTAGGT 4 80 

AGACAAGGAG GTGGAAACGT ACTACAGCAC CTACTAATCT CATCTCTGGG TTAA 534 
(2) INFORMATION FOR SEQ ID NO: 20: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 177 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : not relevant 
5 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

10 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Lys Leu Leu Ser Leu Arg Tyx Leu lie Leu Arg Leu Ser Lys Ser 
15 10 15 



Leu Arg Thr Asn Asp His Leu Val Leu lie Leu lie Lys Glu Ala Leu 
20 20 25 30 

He Asn Tyr Tyr Asn Ala Ser Phe Thr Asp Glu Gly Ala Val Leu Arg 
35 40 45 

25 Asp Ser Arg Glu Ser He Glu Asn Phe Leu Val Ala Arg Cys Gly Ser 

50 55 60 

Gin Asn Ser Cys Arg Val Met Lys Ala Leu He Thr Asn Thr Val Cys 
65 70 75 80 

30 

Lys Met Ser He Glu Thr Ala Arg Ser Phe He Gly Asp Leu lie Leu 
65 90 95 

Val Ala Asp Ser Ser Val Ser Ala Leu Glu Glu Ala Lys Ser He Lys 
35 100 105 110 

Asp Asn Phe Arg Leu Arg Lys Arg Arg Gly Lys Tyr Tyr Tyr Ser Gly 
115 120 125 

40 Asp Cys Gly Ser Asp Val Ala Lys Val Lys Tyr He Leu Ser Gly Glu 

130 135 140 

Asn Arg Gly Leu Gly Cys Val Asp Ser Leu Lys Leu Val Cys Val Gly 
145 150 155 160 

45 
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Arg Gin Gly Gly Gly Asn Val Leu Gin His Leu Leu He Ser Ser Leu 
165 170 175 



Gly 



(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 54 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: not relevant 

15 (ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE : NO 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

25 

ATGGACCTAT CGTTTATTAT TGTGCAGATC CTTTCCGCCT CGTACAATAA TGACGTGACA 60 

GCACTTTACA CTTTGATTAA CGCGTATAAT AGCGTTGATG ATACGACGCG CTGGGCAGCG 120 

30 ATAAACGATC CGCAAGCTGA GGTTAACGTC GTGAAGGCTT ACGTAGCTAC TACAGCGACG 180 

ACTGAGCTGC ATAGAACAAT TCTCATTGAC AGTATAGACT CCGCCTTCGC TTATGACCAA 240 

GTGGGGTGTT TGGTGGGCAT AGCTAGAGGT TTGCTTAGAC ATTCGGAAGA TGTTCTGGAG 300 

35 

GTCATCAAGT CGATGGAGTT ATTCGAAGTG TGTCGTGGAA AGAGGGGAAG CAAAAGATAT 360 

CTTGGATACT TAAGTGATCA ATGCACTAAC AAATACATGA TGCTAACTCA GGCCGGACTG 420 

4 0 GCCGCAGTTG AAGGAGCAGA CATACTACGA ACGAATCATC TAGTCAGTGG TAATAAGTTC 4 80 

TCTCCAAATT TCGGGATCGC TAGGATGTTG CTCTTGACGC TTTGTTGCGG AGCACTATAA 54 0 

4 5 (2) INFORMATION FOR SEQ ID NO: 22: 

A 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 179 amino acids 
(BJ TYPE: amino acid 
(C| STRANDEDNESS : not relevant 
(D) TOPOLOGY: not relevant 

{ii> MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(Xil SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Met Asp Leu Ser Phe lie He Val Gin He Leu Ser Ala .Ser Tyr Asn 
1.5 10 15 

Asn Asp Val Thr Ala Leu Tyr Thr Leu He Asn Ala Tyr Asn Ser Val 
20 25 30 

Asp Asp Thr Thr Arg Trp Ala Ala He Asn Asp Pro Gin Ala Glu Val 
35 40 45 

Asn Val Val Lys Ala Tyr Val Ala Thr Thr Ala Thr Thr Glu Leu His 
50 55 60 

Arg Thr He Leu He Asp Ser He Asp Ser Ala Phe Ala Tyr Asp Gin 
65 70 75 80 

Val Gly Cys Leu Val Gly He Ala Arg Gly Leu Leu Arg His Ser Glu 
85 90 95 

Asp Val Leu Glu Val He Lys Ser Met Glu Leu Phe Glu Val Cys Arg 
100 105 110 

Gly Lys Arg Gly Ser Lys Arg Tyr Leu Gly Tyr Leu Ser Asp Gin Cys 
115 120 125 

Thr Asn Lys Tyr Met Met Leu Thr Gin Ala Gly Leu Ala Ala Val Glu 
130 135 140 

Gly Ala Asp He Leu Arg Thr Asn His Leu Val Ser Gly Asn Lys Phe 
145 150 155 160 
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Ser Pro Asn Phe Gly He Ala Arg Met Leu Leu Leu Thr Leu Cys Cys 

165 170 175 

Gly Ala Leu 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 183 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: not relevant 

15 (ii) MOLECULE TYPE: cDNA 

<iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

25 

ATGAGGCACT TAGAAAAACC CATCAGAGTA GCGGTACACT ATTGCGTCGT GCGAAGTGAC 60 
GTTTGTGACG GGTGGGATGT ATTTATAGGC GTAACGTTAA TCGGTATGTT TATTAGTTAC 120 
30 TATTTATATG CTCTAATTAG CATATGTAGA AAAGGAGAAG GTTTAACAAC CAGTAATGGG 180 

TAA 183 
(2) INFORMATION FOR SEQ ID NO: 24: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
40 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 

45 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

Met Arg His Leu Glu Lys Pro lie Arg Val Ala Val His Tyr Cys Val 
15 10 15 

Val Arg Ser Asp Val Cys Asp Gly Trp Asp Val Phe lie Gly Val Thr 
20 25 30 

Leu lie Gly Met Phe He Ser Tyr Tyr Leu Tyr Ala Leu He Ser He 
35 40 45 

Cys Arg Lys Gly Glu Gly Leu Thr Thr Ser Asn Gly 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide" 

(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: raisc_f eature 

(B) LOCATION: 1..24 

(D) OTHER INFORMATION: /product- "Oligonucleotide" 
/notes *n is inosine at sites in this sequence." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

GGNGGNGGNA CNTTYGAYGT NTCN 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc « "Oligonucleotide." 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
15 UGAGUGAACG CGAUG 

(2) INFORMATION FOR SEQ ID NO:27: 

<i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 20 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc « "Oligonucleotide." 

(iii) HYPOTHETICAL: NO 

30 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

3 5 ATAAGCATTC GGGATGGACC 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

4 0 (A) LENGTH: 22 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



45 



(ii) MOLECULE TYPE: other nucleic acid 
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(A) DESCRIPTION: /desc * "Oligonucleotide." 
(iii) HYPOTHETICAL: NO 

5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

10 ATTAACTTGA CGGATGGCAC GC 22 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide." 

(iii) HYPOTHETICAL: NO 

25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

30 TACTTATCTA GAACCATGGA AGCGAGTCGA CGACTA 36 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

4 0 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide." 

(iii) HYPOTHETICAL: NO 



45 
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(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 30: 
TCTTGAGGAT CCATGGAGAA ACATCGTCGC ATACTA 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide." 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 
ACTATTTCTA GAACCATGGC ATTTGAACTG AAATT 
(2) INFORMATION FOR SEQ ID NO: 32: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide." 

(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
TTCTGAGGAT CCATGGTATA AGCTCCCATG AATTAT 



36 
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TABL2 1 



Particle 

length. 

(nm) 



Coat protein Hr Reference 
(X10M 



1,400-2,200 
1,400-1, 800 

1,400-2,200 
1, 400-2,200 
1,400-2,200 
1,400-2,200 



39 


Gugerli 




(1984) 


26 






Gugerli 




(1984) 


43 


Zimmermann 




(1990) 


36 






Zee (1987) 


36 






Ku (1990) 


36 






Zimmermann 




(1990) 




Gugerli 




(1993) 
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TABLE 2 

Nucleotide and Deduced Amino Acid Sequences for GLRaV-3 
Coat Protein. 



ATCCCATTTCAACTCX?JV.TTACCCCXGXTATA.TGAACTCGTCCCCCAAAATAATTTCAGA 

1 • * SO 

SfAFSLiCLGQXYEVVPENNLR 

CTTAGAGTACCCGATCCGCCACAA<^AAAArrCAGTXACSCCAUl UL^AAAGTACGrT 

VRVCDAAQCKFSKASFL KYV 

AAGCACGCCACACACGCCCAATTAACCGCAAT^^ 

121 * ♦ ■ ' — « 180 

KOGTC A E LT GXAVVPEKYVF - 

CCCACACCACCTTTCCCTACACCGGOSri^^ 

11 ATAALATAAQEPPRQPP>QV 

GTGCAAC CACACGAAAC CG ATATAGGGGTACTGCCGGAATCTCAGACTCrCACACCAAAT 

241 — ♦ " * * 300 

VEPQETOICVVPESETLTPN 

AA O ' l ' llA* r r t r C GAGAAAG ATC CAG ACAA Cj I ' ll 1 iu aacactatgggcaaggg aatag ct 

301 *- - — — — ♦ ' ♦ ♦ 360 

KLVrEKDPDKFLKTM'CKGIA 

TTCCACTTGACGGCAGTTACCCACAAACCCAAAGTTATTAAC 

3 61 * « * * <20 

LDLTGVTKKPKVINEPCKVS 

GTAGACCTGCCAATCAACATTAATCCCCCATTCATGC^ 

421 ' - ► * <80 

VEVAKKINAACMELCKKVMC 

GCCGATG ACGCACC AACTAA C ACAAAATTCTTCTTGTACGTCATCCAG ATTGCTTGCACG 

AODAATKTKFFLYVHQIACT 

TTdTTACA' * 11 *C 7 T C CAC GGAGTTCAAAGACTTTCACTACATAGAAACCG ATCATGGA 

541 * * r ' + 600 , 

FFTSSSTEFKEFDTIETDDC 

AAGAACATATATCCGG * \* lU^O f ATATGATTCCATTXAACAACCi^lWl'lCCACCGCT 

KKIYAVWVYOCIKQAAASTC 

TATCAAAACCCCGTAXCCCACTATCTAGCGTACT^ 

fifil * . > « ♦ 720 

YEN PV R Q Y L A Y F T P-T F I T A T 

CTCAATCGTAAACTAGTGATGAACGAGAAGCTT^ 

721 ' ' 780 

LNCKLVUNEKVMACHCVPPK - 

Tl I » i 1 C CGTACACCATAGA C-T ' ^CU 1 CCGA CGTACGA'IC i'J. /*CAACAACGACCCA 

781 ' ■ B40 

FFP YTI OCVRPTY0LFNM OA 

^TATTAGC ATGC AATTTAG CTAGACAGCACGCGTTTACAAACAAGACCC7AACGGCCG AT 



841 



900 



ILAWN LARQQAFRNKTVTAD 

AACAC CT? A C A C AACGTCTTCCAACTATTGCAAAAG AAGTAG 
901 942 
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TABLE 3 

Comparison of Coat Protein Sequences of GLRaV-3 with pto 
CTV and LIYV. BTVf 



i so 

3YV_C? _ 

CTV_C? 

LrYV_C? 

CLRaV3_CP KAFELiCLCQI YEWPENNU* VRVGOAAQGX FSXASFLKYV KDCTQAELTC 



CONSENSUS 



BYV_CP 
CTV_CP 



51 



100 



KDDETOT.KN 

LrYV_CP KDTDCD NDVFGSCNDT RNNDDKXXEE MKQNTSDNSQ 

CLRaV3_CP IAWPEKYVF ATAAX-ATAAQ EPPRQPPAQV VcPQETDKCV VPSSETLTPN 



CONSENSU J 



"1 ISO 

BYV_CP MCSAE PISAXA. .TF ENVSX».AD.Q TCLKCZDCDK LRK N 

CTV_C? JCNXZTKEGDO WAAES..SF GSVNLKID.P TLITMNDVRQ LSTQQKAAJLN 
LIYV.C? IISTRDKEAD IICSISKEDL SKIWRVDRK DALSANDVQS FR. . .EAMIN 
CLRaV3_CP ICLVt UUJPDfC FLXTMCICCIA LDLTCVTHKP KVT..NEPGK VSVEVAMXXN 

CONSENSUS d -I £ «v -d— nd 1 w 



1S1 200 
BYV_C? FXECUCLXC. . . . VPEDNLC LALCLCLYSC AT. ICTSNKV NVQPTS.TFI 
CTV_C? RDLPLALXCrC YPNLPOKDK0 FH IAMMLYRL AV.KSSSLQS OODTT C ITYT 
LIYV_CP . ..FKRDKDP NRNQPSOKLI IAMEVCVYQK VT2JLCTSAKL C.N AWN LEFT 
GLRaV3_C? . AALMELCXK VMCA DDAATX TKFFLYVWQI ACTFFTS . . S STEFKEFOYI 



CONSENSUS C — lie — pd- 



201 

BYV_CP KASFGGGXZL YLTHGELNSF 

CTV_CP R EGVEV DLSDKLWTDI 

LXYV_CP IAYDQETRTY JCVAO. . FVNY 
CLRaV3_CP FT . . DDCXKI Y . - AVWVYDC 

CONSENSUS ±q 



250 

LCSQKLLEC3C PNKLRCFCRT FQKDYISLRX 
VYNSXGICNR TNALRVWCRT NDALYLAFCR 
KQSR-.KRNS PNWRQYARA KE2CTINNIRS 
IKQAAASTCY ENFVRQYLAY FTPTFITATL 

N — R-y-r y 



251 

BYVJCP EYRCXLPPIA 
CTV_CP QNR.NLSYGG 
LIYV_CP AGXIN.SNCV 
GUUV3_CP NGXLVWNEICV 

CONSENSUS 1 



RANRHCLPAE OHYLAADP. I 
RPLDAGIPAG YHYLCADF.L 
LAAKHCVXAS YRNSTSDFAV 
MA.QHGV7PK FFPYTIDCVR 

-a — hGvpa- y D£ — 



300 

STSTELTDLQ QSRLLLAREN 
.TCACLTDLE CAVYTQAJCEQ 
GFGNDTTDAQ LTSXMLARXQ 
PTYDLFNNDA ILAWNLARQQ 

-c led — lAr-q 



301 

BYV_CP ATH.TEFSSE 
CTV_CP LLX.KRCADE 
L I YV_C? ALC . KC EGGS 
CLRftV3_C? AF7JIXTVTAU 



328 

SPVTSLXQLC PjGLCTCR* 
WVT7JVRQLG K. FNTR* . 
VErHirraQLA MUOtPC - . 
NTLHMVFQll. QKK* 



CONSENSUS a I 
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TABLE 4 

Partial GLRaV-3 Nucleotide Sequence and Encoded Proteins 



ORFla (H2LICASE) 

GTOTC m AC^ACGCGAAGAGTGTGATGAACGACAATTTCAATATCCTTG«GACCCTGGTA 

, * + + + * + 60 

C^CAGATGAATGCGCTTCTCACACTACTTGCTGTTAAAGTTATAGGAACrCTGGGACCAT 

VSTVAKSVMNDNFNILETLV - 

ACTTTGCCCAAGTCCTTTATAGTCAAAGTACCTGGTTCGGTGCTGGTTAGCATAACCACT ^ 

6 1 TCAAACGGGCTCAGGAAATATCAGTTTCATGSACCAAGCCACGACCAATCGTAT^ 
TLPKSFIVKVPGSVLVSITT 

TCGGGCATTTCCGACAAACTTGAACTTCGGGGCGCGTTCGACGTTTCTAAAAAGAATTTC 

121 + + + * ♦ 180 

AGCCCGTAAAGGCTGTTTGAACTTGAAGCCCCGCGCAAGCTGCAAAGATTTTTCTTAAAG 
SGISDK^E LRGAFDVSKKNF - 

TCCAGGAGGTTACGTTCGAGTCCTTTGCGCGTAT^^ ^ 

181 ——————— +— — —————•*— — — — —.——+— — — — — — — - — 

AGGTCCTCCAATGCAAGCTCAGCAAACGCGCATAAAAGATCCCGATAACACCTCCTATGC 
SRRLRSSRLRVFSRAIVEDT - 

ATCAAGGTTATGAAGGGCATGAAATCAGAGGATGGTAAACCACTCCCTATAGC ^^^^ 3Q0 
I kvMKGMKSEDGKPLPIAED - 



^CGCGTTCATGACAGGCAATATGTCAAACGTTCATTGCACTAGGGCTGGTTTG 



421 



TCCGTGTACGCGTTCATGACAG<j^AAXAiTji.w*^v.« J . — -- - + ^ gQ 

AGGCA^TGCGCAAGTACTGTCCGCTATACAGTTT^ 

SVYAFMTGNMSNVHCTRAGL - 

CTCCK^CTCAAAGGCTTGCGCGGCTTCTTTAGCTG^^TC^G^^CGCT ^ 

GA^CrcCGA"GraCGA^CG^ 

LGGSKACAASLAVKGAASRA - 
ACTGGAACAAAACTCTTTTCAGGTCTCACATCCTTTCTTTCCGC ^ 

i^ScTTTTGMAA^^ 

TOTKLFSGLTSFLSAGGLFY - 
Sli[cTTCCGA*CTGCOT™ 



481 

CTACTTCCGAACTGCGGGCCTCTCTCt 

DEGLTPGERLDALTRREHAV 

:gggtcgtttccgga 

AATTCACdVi'lW-*^- 1 -**-* »nun«».v»»» — 

M -««nT. t. EPGASVAKRVV&G 



AAOTCACCTGTAGGCCTCTTAGAACCTGGAGCTTCGGTTGCGAA^ ^ 

TTAAGTGGACATCCGGAGAATCTTGGACCTCGAAC 
NSPVGLLEPGAS 

ACGAAAGCTTTTCTGTC^JSAATTGTCATTGGAGGACTTCAC ^ 

601 ic^ra^X^A^^ _ 
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AGGGTGCTTATTGGTGTTTTTACTCTTTCCATGGCTCTCACTCCGG7GGTCTGGAAGTAC 

661 ♦ * + «■ + * 720 

TCCCACGAATAACCACAAAAATGAGAAAGGTACCGAGAGTGAGGCCACCAGACCTTCATG 
RVLIGVFTLSMALTPVVWXY 

AGAAGGAATATCGCGCGAACTGGCGTGGATGTTTTCC\CCGTGCTCGTTCGGGTACCGCG 

721 -+ + + * * * 780 

TCTTCCTTATAGCGCGCTTGACCGCACCTACAAA 

RRNIARTGVDVFHRARSGTA 

GCCATCGGTTTACAATGTCTTAGTGGAGGAAGGTCGTTAGCTGGTGACGCTGCTCGTGGC 

781 + * * + + + 840 

CGGTAGCCAAATGTTACAGAATCACCTCCTTCCAGCAATCGACCACTGCGACGAGCACCG 
AIGLQCLSGGRSLAGDAA-RG 

GCGTTAACAGTGACTCGAGGAGGGCTATCTTCGGCGGTTGCGGTGACCAGAAATACAGTG 

841 + + + +■ + 900 

CGCAATTGTCACTGAGCTCCTCCCGATAGAAGCCGCCAACGCCACTGGTCTTTATGTCAC 
ALTVTRGGLSSAVAVTRNTV 

GCTAGGCGTCAGGTACCATTGGCGTTGCTTTCGTTTTCCACGTCTTACGCAGTCAGTGGT 

901 + + + + 960 

CGATCCGCAGTCCATGGTAACCGCAACGAAAGCAAAAGGTGCAGAATGCGTCAGTCACCA 
ARRQVPLALLSFSTSYAVSG 

TGCACTTTGTTAGGTATTTGGGCTCATGCTCTCCCTAGGCATTTGATGTTCTTCTTT 

961 + + + + + + 102 0 

ACGTGAAACAATCCATAAACCCGAGTACGAGAGGGATCCGTAAACTACAAGAAGAAACCG 

ctllgiwahalprhlmff'fg 

CTAGGGACGCTCTTCGGGGTGAGTGCCAGTACCAATTCTTGGTCGCTTGGGGGCTATACG 
1021 + + + + + + 1080 

gatccctgcgagaagccccactcacggtcatggttaagaaccagcgaacccccgatatgc 
lgtlfgvsastnswslggyt 

aacagtctgttcaccgtaccggaattaacttgggaagggaggagttacagatctttattg 

1081 + + * * < * 1140 

ttgtcagacaagtggcatggccttaattgaacccttccctcct 
•nslftvpeltwegrsyrsll 

cccc^u^gcagctttaggtatttctctcgttgtgcgcgggttgttaagtga 

1141 + + + + + + 1200 

ggggttcgtcgaaatccataaagagagcaacacgcgcccaacaa^ 
pqaalgislvvrgllsetvp 

caactaacgtacgtaccgccgattgaaggtcggaatctttatgatcaggc^ 

1201 + h + + + 1260 

GTTGATTGCATGCATGGCGGCTAACTTCCAGCCTTACAAATACTAGTCCGTGATTTAAAA 
Q LTYV P P I EGRNVYDQALNF 

TATCGCGACTTTGACTATGACGATGGTGCAGGCCCATCCGGGACGGCTGGTCAAAGCGAT 

1261 + + + + ► * 1320 

ATAGCGCTGAAACTGATACTGCTACCACGTCCGGGTAGGCCCTGCCGACCAGTTTCGCTA 
vp nPnYDDGAGPSGTAGQSD 
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CCTGC-AACCAATACTTCGGATACTTCTTCGGTTTTC7CTGACGATGGTTTGCCCGCTAGT 

1321 + + + 1- * 1380 

GGACCTTGGTTATGAAGCCTATGAAGAAGCCAAAACAGACTGCTACCAAACGGGCGATCA 
FGTHTSDTSSVFSDDGLPAS 

GGCGGTGGCTTCGACGCGCGCGTTGAGGCAGGTCCCAGCCATGCTGTTGATGAATCACCA 

13 81 + «■ * ^ 1440 

CCGCCACCGAAGCTGCGCGCGCAACTCCGTCCAGGGTCGGTACGACAACTACTTAGTGGT 
G G GFDA R V EAG P S HAVDE S P 

AGGGGTAGTGTTGAGTTCGTCTACAGAGAACGTGTAGATGAACATCCGGCGTGTGGTGAA 

1441 + + + + + + 1500 

TCCCCATCACAACTCAAGCAGATGTCTCTTGCACATCTACTTGTAGGCCGCACACCACTT 
RGSVEFVYRERVDEHPACGE 

GCTGAAGTTGAAAAGGATCTAATAACACCACTTGGTACAGCTGTCTT^ 

1S01 +■ * * -+ - + 1560 

CGACTTCAACTTTTCCTAGATTATTGTGGTGAACCATGTCGACAGAATCTCAGCGGGGGG 
AEVEKDLITPLGTAVLESPP 

GTAGGTCCTGAAGCTGGGAGCGCGCCCAACGTCGAGGACGGTTGTCCGGAGGTTGAAGCT 

1561 + + + h + + 1620 

CATCCAGGACTTCGACCCTCGCGCGGGTTGCAGCTCCTGCC^ 
VGPEAGSAPNVEDGCPEVEA 

GAGAAATGTTCGGAGGTCATCGTTGACGTTCCTAGTTCAGAACCGCCGGTACAAGAAGTC 

1621 + + + + + + 1680 

CTCTTTACAAGCCTCCAGTAGCAACTGCAAGGATCAAGT 

EKCSEVIVDVPSSEPPVQEV 

CTTGAATCAACCAATGGTGTCCAAGCTGCAAGAACTGAAGAGGTTGTGCAGGGCGACA^ 

1681 h + + 1- * + 1740 

GAACTTAGTTGGTTACCACAGGTTCGACGTTCTTGACTTCTC 

L E S TNG VQAARTE EVVQGDT 

TGTGGAGCTGGGGTAGCTAAATCAGAAGTGAGTCAACGTGTGTTTCCTGCGCAAGTACCC 

1741 + + + + + + 1800 

ACACCTCGACCCCATCGATTTAGTCTTGACTCAGTTGCACACAAAGGA^ 
CGAGVAKSEVSQRVFPAQVP 

GCACATGAAGCTGGTCTTGAGGCATCTAGTGGCG^ 

1801 ♦ + + -t + + 1860 

CGTGTACTTCGACCAGAACTCCGTAGATCACCGCGCCAGCACCTCGGTAACGTTCAAAGA 
AHEAGLEASSGAVVEPLQVS 

GTGCCAGTAGCCGTAGAGAAAACTGTTTTATCTGTCG^ 

1861 + + + + + 1920 

CACGGTCATCGGCATCTCTTTTGACAAAAIAGACAGCrcr^ 

V P VAV E KTVL SVE KARE LKA 

GTAGATAAGGGCAAGGCGGTCGTGCACGCAAAGGAAGTCAAGAATGTACCGGTTAAGACG 

1921 a ~ h + + + + 1980 

CATCTATTC CCGTTC C GC CAGCACGTGCGTTTCCTTCAGTTCTOACATGGCCAATTCTGC 
VDKGKAVVHAKEVKNVPVKT 
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TTACCACGAGGGGCTCTAAAAATTAGTGAGGATACCGTTCGTAAGGAATTGTGCATGTTT 

1981 + 2040 

AATGGTGCTCCCCGAGATTTTTAATCACTCCTATGGCAAGCATTCCTTAACACGTACAAA 
LPRGALK I SEDTVRKELCM c 

AGAACGTGTTCCTGCGGCGTGCAGTTGGACGTGTACAATGAAGCGACCATCGCCACTAGG 

2041 * + + + + + 2100 

TCrTGCACAAGGACGCCGCACGTCAACCTGCACATGTTACrrCGCTGGTAGCGGTGATCC 
RTCSCGVQLDVYNEATIAT R 

TTCTCAAATGCGTTTACCTTTGTCGATAGCTTGAAAGGGAGGA 

2101 + * * + + 2160 

AAGAGTTTACGCAAATGGAAACAGCTATCGAACTTTCCCTCCTCACGCCAGAAAAAGAGT 
FSNAFTFVDSLKGRSAVFF S 

AAGCTGGGTGAGGGGTATACCTATAATGGTCKjTAGCCATGTTTCATCAGGGTGGCCTCGT 

2151 * * «■ + + : * 2220 

rrCGACCCACTCCCCATATGGATATTACCACCATCGGTACAAAGTAGTCCCACCGGAGCA 
KI/GEGYTYNGGSHVSSG WPR 

GCCCTAGAGGATATCTTAACGGCAATTAAGTACCCAAGCGTCTTCGACCACTGTTTAGTG 

2221 + + + * * + 223 0 

CGGGATCTCCTATAGAATTGCCGTTAATTCATGGGTTCGCAGAAGCTGGTGACAAATCAC 
ALEDILTA IKYPSVFDHCLV 

CAGAAGTACAAGATGGGTGGAGGCGTACCATTCCACGCTGATGACGAGGAGTGCTATCCA 

2281 + + «► + * + 2340 

GTCTTCATGTTCTACCCACCTCCGCATGGTAAGGTGCGACTACTGCTCCTCACGATAGGT 
QKYKMGGGVPFHADDEECYP 

TCAGATAACCCTATCTTGACGGTCAATCTCGTGGGGAAGGCAAACTTCTCGACTAAGTTC 

2341 + + + + + f 2400 

AGTCTATTGGGATAGAACTGCCAGTTAGAGCACCCCTTCCGTTTGAAGAGCTGATTCACG 
SDNPIL TVNLVGKANFSTKC 

AGGAAGGGTGGTAAGGTCATGGTCATAAACGTAGCTTCGGGTGACTATTTTCTTATGCCT 

2401 + + + + + + 2460 

7CCITCCCACCATTCCAGTACCAGTATTTGCATCGAAGCCCACTGATAAAAGAATACGGA 
RKGGKVMV I NVASGDYFLKP 

TGCGGTTTTCAAA.GGACGCACTTGCATTCAGTAAACTCCATCGACGAAGGGCGCATCAGT 

2461 h + + + + 2520 

ACGCCAAAAGTTTCCTGCGTGAACCTAAGTCATTTGATC 

CGFQRTHL HSVNSIDEGRI S 

TTGACGTTCAGGGCAACTCGGCGCGTCTTTGGTGTAGGCAGGATCTTGC^ 

2521 m + + + + + 2580 

AACTGCAAGTCCCGTTGAGCCGCGCAGAAACCACATCCGTCCTACAACGTCAATCGGCCG 
LTFRA TRRVFGVGRM LQLAG 

GGCGTGTCGGATGAGAAGTCACCAGGTGTTCCAAACCAGCAACCACAGAGCCAAGGTGCT 

2581 + + + + + + 2640 

CCGCACAGCCTACTCTTCAGTGGTCCACAAGGTTTGGTCGTTGGTGTCTCGGTTCCACGA 
G VSDEKS PGVPNQQPQSQGA 
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ACCAGAJKAATCACACCL^AATCGG^ 270Q 

2541 T^T^^TTACTG^T^AGCCCCCCGTTCCGAGATAGACTCCCTTCACC^TCCCTT 
TR W TITPKSGGKALSEGSG5. E 

gtcaaggggaggtcgacatactcgatatggtgcgaacaagattacs^^ 27go 
2701 cag^cccctccagctgtatgagctatac^cgcttgttctaatgc^ 

VKGRSTYSXWCEQDYV RKCE - 
TGGCTCAGGGCTGATAATCCAGTGATGGCTCTTRAACCT^ 282Q 

27 61 IcCGAGTCCCGACTACTAGG^CTACCGAGAAVTT^ 

W LS.ADNPVMAI1? PGVTPMTF - 

gaagtggttaaawc^acctctc^gatgccgtcgtgg^ac^aagtatc^ct 2aao 

2821 crcCACCAAraCGGCCC^GAC 

evvkagtsedavveylkyla - 

ATAGGCATTGGGAGGACATACAGGGCGTTGCrc^^ 2g4Q 

tatccgtaIcc^ctcta^cccgcaacgaataccgatctttataacgg^ . 
igigrtyrallmarniavtt - 

2941 CGGCrcccACA^GICmcl^ 

aegvlkvpnqvyeslpgfhv - 

TACAACTCGGGCACAGATCTCATTTTTCATTC^^ ^ 

3001 Z^i^S^M^^^ 

YKSGTDLIFHSTQDGLRVRD 



2881 



CTACCGTACGTATTCATAGCTGAGAAAGGTATTTTOATC^GGGCAAAGATGTC ^ 

gmg^g^aIctmcgIc^^ 

LPYVFIAEKGIFIKGKDVDA - 

gtagtagctttgggcgacaatctctccgtatgtgatgatat^ 3iao 

^t^tcgaaIccot^^IgI^^^cactactatataaccaaaaggtactacga 

vvalgdnlsvcddilvfhda - 

attaatttgatcggtgcactgaaagttgctcgatctggtatggtgg 324q 

^AA^A^C^CcIotS^ 

inlmg alkvarcgmvgesfk - 

TCGTTCGAATACAAATGCTATAATGCTCCC^CAGGTGGCGGTAAGA 33oq 

3241 IcCA^ra^AC^A^G^^ 

S FEYKCYNAP pGGGKTTMLv - 



2061 



3121 



3181 
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GACGAATTTGTCAAGTCACCCAATAGCACGGCC^CCATTACGGCTAACGTGGGJ^GTTCT 

3301 * 3360 

CTGC7TAAACAGTTCAGTGGGTTATCGTGCCGGTGGTAATGCCGATTGCACCCTTCAAGA 
DZFVXSPNSTATXTANVGSS 

GAGGACATAAATATGGCGGTGAAGAAGAGAGATCCGAATTTGGAAGG7CTCAACAGTGCT 
3351 . + ^ 3420 

ctcctgtatttataccgccacttcttctctctaggcttaaaccttccagagttgtcac^ 
e~dinmavkkrdpnleglnsa 

accacagttaactcc^gggtggttaactttattgtca^ 

3 421 + + * +- * * 3480 

TGGTGTCAATTGAGGTCCCACCAATTGPJVATA^ 

TTVNSRVVNFXVRGMYKRVIi 

GTGGATGAGGTGTACATGATGCATCAAGGCTTACTACAACTAGGCGTCTTCGCAACCGGC 

3481 + + * + + * 3540 

CACCTACTCCACATGTACTACGTAGTTCCGAATGATGTTGATCCGCAGAAGCGTTGGCCG 

VD'EVYMMH QGL LQLGVFATG 

GCGTCGGAAGGCCTCTTTTTTGGAGACATAAATCAGATACCATTCATAAACMGGGAGAAG 

3541 + + - + + * 3 600 

CGCAGCCTTCCGGAGAAAAAACCTCTGTATTTAGTCTATGGTAAGTATTTGKCCCTCTTC 
ASEGLFFGDINQI PFINREK 

GTGTTTAGGATGGATTGTGCTGTATTTGTTCCAAAGAACT 
3601 + + + + H * 3660 



VFRMDCAVFVP KKESVVYTS 

AAATC^TACAGGTGTCCGTTAGATGTTTGCTACTTGTTGTCCTC 

3661 + + * * * * 3720 

TTTAGTATGTCCACAGGCAATCTACAAACGATGAACAACAGGAGTTACTGGCATTCCCCT 

KSYRCP LDVCYLLS SMTVRG 

ACGGAAAAGTGTTACCCTGAAAAGGTCG7TAGCGGTAAGGACAAACCAGTAGTAAGATCG 

3721 + * + * * 3780 

TGCCTTTTCACAATGGGACTTTTCCAGCAATCGCCATTCCTGTTTGGTCA 

TEKCYP EKVVSGKDKPVVRS 

CTGTCCAAAAGGCCAATTGGAACCACTGATGACGTAGCTGAAATAAACGCTGACGTGTAC 

3781 + * * + 3840 

GACAGGTTTTCCGGTTAACCTTGGTC^CTACT 

LSKRPIGTTDDVAEINADVY - 

TTGTGCATGACCCAGTTGGAGAAGTCGGATATGAAGAGGTCGTTGAAGGGAAAAGGAA^ 

3841 * - + * + + 3900 

AACACGTACTGGGTCAACCTCTTCA^ 

LCMTQLEKSDMKRSLKGKGK 

GAAACACCAGTGATGACAGTGCATGAAGCACAGGGAAAAACATTC 

3901 + + * * + * 3960 

CTTrcTCGTCACTACTCTCAC^ 

ETPVMTVHEAQGKTFSDVVL 
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^TAGG^ CG AAGAAAGC Z S ATGACTC CCTATTC ACTAAACAACCGCATATACTTGTTCG T 

3961 11 * + + 4020 

AAATCCTGCTTCTTTCGGCTACrGAGGGATAAGTGATTTGTTGGCGTATATGAACAACCA 
FRTKKADDSLFTXQPHILVG 

TTGTCGAGACACACACGCTCACTGGTTTATGCCGCTCTGAGCTCAGAGTTGGACGATAAG 

4021 - -r + + * 4060 

" AACAGCTCTGTGTGTGCGAGTGACCAAATACGGCGAGACTCGAGTCTCAACCTGCTATTC 
LSRHTRSLVYAALSSELDDK 

(FRAME SHIFT) 

GTCGGCACATATATTAGCGACGCGTCGCCTCAATCAGTATCCGACGCTTTGCTT2ACACG 

4081 * * + + + + 4140 

CAGCCGTGTATATAATCGCTGCGCAGCGGAGTTAGTCATAGGCTGCGAAACGAAGTGTGC 

VGTYISDA S pQSVSDALL/fT 

SAHILATRHLNQYPTLCFTR - 

ORFlfc (RdSp) 

TTCGCCCCGGCTGGTTGCTTTCGAGGTATATGAGCGTATGAATTTTGGACCGACCTTCGA 

4141 + + + * + 4200 

AAGCGGGGCCGACCAACGAAAGCTCCATATACTCGCATACTTAAAACCTGGCTGGAAGCT 

FAPACCFJ?GI* 
SPRLVAFEVYERMNFGPTFE - 

AGGGGAGTTGGTACGGAAGATACCAACAAGTCATTTTGTAGCCGTGAATGGGTTTCT 

4201 <■ + + + + 4260 

TC CCCTCAAC CATGCCTTCTATGGTTGTTCAGTAAAACATCGGCACTTACCCAAAGAGCT 
GELVRKI pTSHFVAVNGFLE - 

GGACTTACTCGACGGTTGTCCGGCTTTCGACTATGACTTCTTTGAGG 

4261 * + * + + 4320 

CCTGAATGAGCTGCCAACAGGCCGAAAGCTGATACTCAAGAAACTCCTACTAAAGCTTTG 

DLLDGCPAFDYDFFEDDFET - 

TTCAGATCAGTCTTTCCTCATAGAAGATGTGCGCATTTCTGAATC^ 

4321 - * + + + * 4380 

AAGTCTAGTCAGAAAGGAGTATCTTCTACACGCGTAAAGACTTAGAAAAAGAGTAAAACG 

SDQSFLIEDVRISESFSHFA - 

GTCGAAAATAGAGGATAGGTTTTACAGTTTTATTAGGTCTAGCGTAGGTTTACCAAAGC^ 

43 81 + < + * * 4440 

CAGCTTTTATCTCCTATCCAAAATGTCAAAATAATCCAGATCGCATCCAAATGGTTTCGC 

SKIEDRFYSFIRSSVGLPKR - 

CAACACCTTGAAGTGTAACCTCGTCACGCTTGAAAATAGGAATTC 

4441 + + * +: * * 4500 

GTTGTGGAACTTCACATTGGAGCAGTGCAAACTTTTATCCTTAAGGTTGCGGCT 
NTLKCNLVTFENRNSNADRG - 

T^TAACGTGGGTTGTGACGACTCTGTGGCGCATGAAC^ 

4S01 * + + + 4560 

AACATTGCACCCAACACTGCTGAGACACCGCGTACTTGACTTCCTCTAAAA 

CNVGCDDSVAHELKEIFFEE- 
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4561 



4622 



GGTCGTTAACAAAGCTCGTTTAGCAGAGGTGACGGA^^GCCATTTGTCCAGCAJkCACGAT 

+ * * + + 4620 

CCAGCAATTGTTTCGAGCAAATCGTCTCCACTGCCTTTCGGTAAACAGGTCGTTGTGC^A 
VVNKARLAEVTESH LSSNTM- 

GTTGTTATCAGATTGGTTGGACAAAAGGGCACCTAACGCTTACAAGTCTCTCAAGCGGGC 

+ + + * + + 4$fl0 

CAACAATAGTCTAACCAACCTGTTITCCCGTGGATTGCGAATGTTCAGAGAGTTCGCCCG 
LLSDWLDKRAPNAYKSLKRA- 



4681 



4741 



^ + + + + 474Q 

AAATCCAAGCCAACAGAAAGTAGGCAGATACAACTGCAGAATATGCGAGTACCACTTTCG 
LGS VVFHP SMLTS VTLMVKA 

AGACGTAAAACCCAAGTTGGACAATACGCCATTGTCGAAGTACGTAACGGGGCAGAATAT 

+ + + + + + 480Q 

TCTGCATTTTGGGTTCAACCTGTTATCCGGTAACAGCTTCATGCATTGCCCCGTCTTA1A 
DVKPKLDNT PLSKYVTGQNI - 



4801 



\TTTTTACTGCGTGCGTAGA 



TCAGATGGTGCTATCCACGCATTGACGCGAAAAAAGAACGTAAAAATGACGCACGCATCT 
VYHDRCVTALFSC I F T A C V E - 



4860 



4861 



GCGCTTAAAATACGTAGTGGACGAAAGGTGGCTCTTCTACCACGGGATGGAC\CTGCGGA 



CGCGAATTTTATGCATCACCTGCTTTCCACCGAGAAGATGGTGCCCTACCTGTGACGCCT 
RLKYVVDERWLFYHGMDTAE 



-+ 4920 



4921 



GTTGGCGGCTGCATTGAGGAACAATTTGGGGGACATCCGGCAATACTACACCTATGAACT 



CAACCGCCGACGTAACTCCTTGTTAAACCCCCTGTAGGCCGTTATGATGTGGATACTTGA 
LAAALRNNLGDIRQYYTYEL- 



4980 



GGATATCAGTAAGTACGACAAATCTCAGAGTGCTCTCATGAAGCAGGTGGAGGAGTTGAT 

4981 «. ** + + + + 5040 

CCTATAGTCATTCATGCTGTTTAGAGTCTCACGAGAGTACTTCGTCCACCTCCTCAACTA 
DISKYDKSQSALMKQVEEL I 

ACTCTTGACACTTGGTGTTGATAGAGAAGTTTTGTCTACTTTC 

5041 + + h —h + + 5100 

TGAGAACTGTGAACCACAACTATCTCTTCAAAACZAGATGAAAGAAAACAC 
LLTLGVDREVLSTFFCGEYD- 

TAGCGTCGTGAGAACGATGACGAAGGAATTGGTGTTGTCTGTCGGCTC 
5101 k * + + h + 5160 

ATCGCAGCACTCTTGCTACTGCTTCCTTAACCACAACAGACAGCCGAGAGTCTCCGCGTC 
SVVRTMTKELVLSVGSQRRS - 



TGGTGGTGCTAACACGTGGTTGGGAAATAGTTTAGTCTT 

5161 + + + ^ + + 5220 

ACCACCACGACTGTGCACCAACCCTTTATCAAATCAGAACACGTGGA 
GGANTWLGNS LVL C T L L S. VV - 
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ACTTAGGGGATTAGATTATAGTTATATTGTAGTTAGCGGTGATGATAGCCTTATATTTAG 

5221 + + * 5280 

TGAATCCCCTAATCTAATATCAATATAACATCAATCGCCACTACTATCGGAATA.TAAATC 
LRGLDYSYIVVSGDDSLIFS - 

TCGGCAGCCGTTGGATATTGATACGTCGGTTCTGAGCGATAATTTTGGTTTTGACGTAAA 

5281 + + + + 5340 

AGCCGTCGGCAACCTATAACTATGCAGCCAAGACTCGCTATTAAAACCAAAACTGCATTT 
RQPLDIDTSVLSDNFGFDVK- 

GATTTTTAACCAAGCTGCTCCATATTTTTGTTCTAAGTTTTTAGTTCAAGTCGAGGATAG 

5341 + + + + + + 5400 

CTAAAAATTGGTTCGACGAGGTATAAAAACAAGATTCAAAAATCAAGI^ 
IFNQAAPYFCSKFLVQVEDS - 

TCTCTTTTTTGTTCCCGATCCACTTAAACTCTTCCT^ 

5401 + + + + + + 5460 

AGAGAAAAAACAAGGGCTAGGTGAATTTGAGAAGCXATTCAAACCTC 
LFFVPDPLKLFVKFGASKTS - 

AGATATCGACCTTTTACATGAGATTTTTCAATCTTTCGTCGATCTTTCGAAGGGTTTCAA 

5461 -4- + * + + 5520 

TCTATAGCTGGAAAATGTACTCTAAAAAGTTAGAAAGCAGCTAGAAAGCTTCCCAAAGTT 
DIDLLHEIFQSFVDLSKGFN- 

TAGAGAGGACGTCATCCAGGAATTAGCTAAGCTGGTGACGCGGAAATATAAGCATTCGGG 

5521 + + * + + + 5580 

ATCTCTCCTGCAGTAGGTCCTTAATCGATTCGACCACTGCGCCTTTATATTCGTAAGCCC 
REDVIQELAKLVTRKYKHSG - 

ATGGACCTACTCGGCTTTGTGTGTCTTGCACGTTTXAAGTGCAAATTTTT 

5581 + + + + + + 5640 

TACCTGGATGAGCCGAAACACACAGAACGTGCAAAATTCACGTTTAAAAAGCGTCAAGAC 
WTYSALCVLHVLSANFSQFC- 

TAGGTTATATTACCACAATAGCGTGAATCTCGATGTGCGCCCTATTCAGAGGACCGAGTC 

5641 + + + + + + 5700 

ATCCAATATAATGGTGTTATCGCACTTAGAGCTACACGCGGGATAAGTCTCCTGGCTCAG 
RXiYYHNSVNLDVRPIQRTES - 

GCTTTCCTTGCTGGCCTTGAAGGCAAGAATTTTAAGGT 

5701 — + + + + + 5760 

CGAAAGGAACGACCGGAACTTCCGTTCTTAAAATTCCACCTTTCGAAGAGCAAAACGGAA 
L S LLALKARI LRWKASRFAF - 

TTCGATAAAGAGGGGTTAATCGCGTTGGCCACGCTATAGTGTTTCTGTGCCTCGGTTC 

5761 : +- + + + + + 5820 

AAGCTATTTCTCCCCAATTAGCGCAACCGGTGCGATATCACAAAGAC^ 
S I K R G * 

CGTGAGGTTAATACCGAAGGGTCGTCGTACTTATCTCAGT^^ 

5821 + + + + + 5880 

GCACTCCAATTATGGCTTCCCAGCAGCATGAATAGAGTCAATAAATA 

CTTAGGCGTGCCATCCGTGAAGTTAATACCGGTGGCACTCCTTCTCGAAGTGGGTATTAA 

5881 + + + + + + 5940 

GAATCCGCACGGTAGGCACTTCAATTATGGCCACCGTGAGGAAGAGCTTCACCCATAATT 
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AGACCAAAATTTTTTATTTGTSTGTACTTT^ 

5341 * + + + 6000 

TCTGGTTTTAAAAAATAAJ^CXCACATGAAA.^^^ 

0RT2 <7X) 

GGTGGAACATGTAC\GTAGAGGGTCTTTCTTTAAGT^ 

6001 + + * + 6060 

CCACCTTGTACATGTCATCTCCCAGAAAGAAATTCAGAGCCCAATGGGAAGGATGAGAAC 

MYSRGS FFKSRVTLPTLV- 

TCGGAGCATACATGTGGGAGTTTGAACTCCCGTATCTTACGGACAAGAGAC^ 

60 61 + + + + + + 6120 

AGCCTCGTATGTACACCCTCXAACTTGAGGGCATAGAATGCCTGTTCTCTGTGTAGTCGA. 
GAYMWE FE L P YLTD KRHISY- 

ATAGCGCGCCAAGTGTCGCGACTTTTAGCCTTGTGTCGAGGTAGGATAGGGGCCAACAGG 

6121 + ♦ * +- «■ 6180 

TATCGCGCGGTTCACAGCGCTGAAAATCGGAACACAGCTCCATCCTATCCCCGGTTGTCC 
S APSVATFSLVSR* 

TGACCAACAGCCTGCACTTAAGGTGCGCTGGAAGTGTTGGATTTGGTCTCAGTGTGCCAA 

618-1 + + + * + + 6240 

ACTGGTTGTCGGACGTGAATTCCACGCGACCTTCACAACCTAAACCAGAGTCACACGGTT 

ATATCCTTTTAGGCGATGTACAGGAGTCTAGTTTAGTGTGTCTTTGGGGGATGACGGGAG 

6241 h + + + + 6300 

TATAGGAAAATCCGCTACATGTCCTCAGATCAAATCACACAGAAACCCCCTACTGCCCTC 

CGACTAGGTTTAGGACTGTAGCTGCTATGTAAGTCGTGCATGCGGCATTGTGCGTAAGAC 

6301 + + + + + + 63 60 

GCTGATCCAAATCCTGACATCGACGATACATTCAGCACGTACGCCGTAACACGCATTCTG 

GTGCATGCATTTGGGCGAGTGCCCTAGGGCAGCGTCGGTCAGGTGACTAGCAGCCGGCTC 

6361 + + + + + * 6420 

CACGTACGTAAACCCGCTCACGGGATCCCGTCGCAGCCAGTCCACTGATCGTCGGCCGAG 

TACGGAGCGCTGAAAGTGCTAGGTCCTGAAGGTAC^GTTGGGCTGAGG 

6421 + + + + + * 6480 

ATGCCTCGCGACTTTCACGATCCAGGACTTCCATGTCAACCCGACTCCGTCCTGTACCAA 

GAACGAGTTGACCGTGGGGACCAGCGGCGGTGACTCGGGCCGTAGCCACGCGCGGGGCGG 

6481 + + + •* + 6540 

CTTOCTCAACTGGCACCCCTGCTOGCCGCCACTGaGCCCGGCATC^ 

CAGGGCGTCTCGTGGTGTATCTGGGCAAGATACGGCTTTATT^ 

6541 + + + + + + 6600 

GTCCCGCAGAGCACCACATAGACCCGTTCTATGCCGAAATAATCCGTGGTATTATACCTC 

CCCAAAGCGTCGGGGTCGGGAAACATCTCCATAGCTTAGTGGCAGCAGCCT^ 

6601 + + + + * 6660 

GGGTTTCGCAGCCCCAGCCCTTTGTAGAGGTATCGAATCACCGTCGTCGGATTCTATCCG 

TGGGAGGCCCGTTCCCTGTAGTAGTGGTGGGTTAGCATGCCACTAAGCGGTGCGGCGTGA 

6661 - + + " * S720 

ACCCTCCGGGCAAGGGACATCXTCACCACCCAATCGTACGGTGATTCGCCACGCCGCACT 
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TAAGGCGCCACCGTCCGTAGTTAGGCGACCCGTGTTTTAATAGGGTCTCTTTAGTTAAGT 
ATT C C 3CGGTGGCAGGCATC AAT CC GCTGGGCACAAAATTATCC CAGAGAAATCAATTC A 
TTAGGCATGTCGTACAGTTAGGATTTCrTTTT^^ 

_ ^ + + +. + 

AATCCGTACAGCATGTCAATCCTAAAGAAAAATCTATAAGAAAATAAAAAATAACAAAC^ 



TAGTTTAGATGTACATTATTACGTAGGTTACTTTGGCGCTACGCCAGAGGTTTTTCCTCT 

6841 + * + + — 6900 

ATCAAATCTACATGTAATAATGCATCCAATGAAACCGCGATGCGGTCTCCAAAAAGGAGA 



TTGTGTGTAGCCTTTAATGTAGGTTTCTTTGTTTTATTTTT 
+ - ^ — +- — 

AACACACATCGGAAATTACATCCAAAGAAACAAAATAAAAACGGAAAGTCCGCCGCCX^ 
TCTTTTCTTCTATTTAGGTTTATCTTCTTTCCTTAGTG 

AGAAAAGAAGATAAATCCAAATAGAAGAAAGGAATCACAACAGCATATACTGCGATGCAG 
CAAATTATGAATTTTCCTTCGTGTAGGCGTCGTTGAGTGCGTTCATCGGCGCTAGACGAG 

^ + > + — — -4- 

GTTTAATACTTAAAAGGAAGCACATCCGCAGCAACTCACGCAAGTAGCCGCGATCTGCTC 
GTTTAGTGGCGACATAAATAGGTTTTTGCGCGAGAT^^ 

+ + + + — f + 

CAAATCACCGCTGTATTTATCCAAAAACGCGCTCTAACCCTATCTTGCTCAAGCGGAATT 

AAGAGAAATCGGGGAA.GGCGCCACGCGAATGACCTTCGTGCTGAGCGAAGGTAGTATCGT 

TTCTCTTTAGCCCCTTCCGCGGTGCGCTTACTGGAAGCACGACTCGCTTCCATCATAGCA 



ORF3 ( 5K , Mombxan« p ro t a in ) 

GATTTTATATTGAAGT AGGC GT ATTTGTTT ATGG ATG ATTTT AAACAGGCAATACT GTTG 

7201 - — * * + * 7260 

CTAAAATATAACTTCATCCGCATAAACAAATACCTACTAAAATTTGTCCGTTATGACAAC 

MDDFKQAILL 

CTAGTAGTCGATTTTGTCTTCGTGATAATTCTGCTGCTGGTTCTT 

7261 ► + + * + + 7320 

GATCATCAGCTAAAACAGAAGCACTATTAAGACGACGACCAAGAATGCAAGCAGCAGGGC 

LVVDFVFVIILLLVLTFVVP 

AGGTTACAGCAAAGCTCCACCATTAATACAGGTCTTAGGACAGT^ 

7321 h < * + + + 7380 

TCCAATGTCGTTTCGAGGTGGTAATTATGTCCJIGAATCCTGTCACACTAAGGAGGA 
RLQQSSTINTGLRTV* 

0R74 (HSP70 Homolog) 

TTAGATATGGAAGTAGGTATAGATTTTGGAACCACTTTCAGCACAATCTGCTTTTCCC^ 

73 81 + + * * — * 7440 

AATCTATACCTTCATCCATATCTAAAACCTTGGTGAAAGTCGTGTTAGACGAAAAGGGGT 

MEVGIDFGTTFSTICFSP 
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TCTGGGGTCAGCGGTTGTACTCCTGTGGCC3GTAGTGTTTACGTTGAAACCCAAATTTT7 

7441 * * + + * ♦ 7500 

AGACCCCAGTCGCCAACATGAGGACACCGGCCA7CACAAATGCAACTTTGGGTTTAAAAA 
SGVSGCTPVAGSVYVSTQIF 

ATACCTGAAGGTAGa^TACTTACTTAATTGGTAAAGCTGCGGGGAAAGCTTATCGTGAC 

7501 + * * + - + 7560 

TATGGACTTCCATCGTCATGAATGAATTAACCATTTCGACGCCCCTTTCGAATAGCACTG 
IPEGSSTYLIGKAAGKAYRD 

GGTGTAGAGGGAAGGTTGTATGTTAACCCGAAAAGGTGGGCAGGTGTGACGAGGGATAAC 

7561 + * + + + + 7620 

CCACATCTCCCTTCCAACATACAATTGGGCTTTTC 

GVEGRLYVNPKRWAGVTRDN 

GTCGAACGCTACGTCGAGAAATTAAAACCTACATACACCGTGAAGATAGACAGCGGAGGC 

7621 + + + + + + 7680 

CAGCTTGCGATGCAGCTCTTTAATTTTGGATGTATC 

V E. RYVEKLKPTYTVKIDSGG 

GCCTTATTAATTGGAGGTTTAGGTTCCGGACCAGACACCTTATTGAGGGTCGTT 

7681 + + + + + + 7740 

CGGAATAATTAACCTCCAAATCCAAGGCCTGGTCTGTGGAATAACTCCCAGCAACTGCAT 
ALLIGGLGSGPDTLLRVVDV 

ATATGTTTATTCTTGAGAGCCTTGATACTGGAGTGCGAAAGGTATACGTCTACGACGGTT 

7741 + + + + + + 7800 

TATACAAATAAGAACTCTCGGAACTATGACCTCACGCTTTCCATATGCAGATGCTGCCAA 
ICLFLRALILECERYTSTTV 

ACAGCAGCTGTTGTAACGGTACCGGCTGACTATAACTC 

7801 + + + + + 7860 

TGTCGTCGACAACATTGCCATGGCCGACTGATATT^ 

TAAVVTVFADYNSFKRSFVV 

GAGGCGCTAAAAGGTCTTGGTATACCGGTTAGAGGTGTTGTTAACGAACCGACGGCC 

7861 + + + + + + 7920 

CTCCGCGATTTTCCAGAACCATATGGCCAATCTCCACAACAATTGCTTGGCTC 
EALKGLGI PVRGVVNEPTAA 

GCCCTCTATTCCTTAGCTAAGTCGCGAGTAGAAGACCTATTATTAGCGGTTTTTGA 

7921 + + + + + + 79 80 

CGGG AGATAAGGAATCGATTCAGCGCTCATCTTCrGGATAATAATC^ CAAAAACTAAAA 
ALYSLAKSRVEDLLLAVFDF 

GGGGGAGGGACTTTCGAOTTCTCATTCGTTAAGAAGAAGGGAAATATACT 

7981 + + + + + 8040 

CCCCCTCCCTGAAAGCTGCAGAGTAAGCAATTCTTCTTCCCT^ 
GGGTFDVSFVKKKGNILCVI 

TTTTCAGTGGGTGATAATTTCTTGGGTGGTAGAGATATTC 

8041 + «- + -t + + 8100 

AAAAGTCACCCACTATTAAAGAACCCACCATCTCTATAACTATCTTCGATAGCACCTTCAA 
F SVGDN FLGGRDIDRAIVEV 
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ATCAAACAAAAGATCAAAGGAAAGGCGTCTGATGCCAAGTTAGGGATATTCGTATCCTCG 

8101 + * + * 81^C 

TAGTTTGTTTTCTAGTTTC CTTTCCGCAGACTACGGTTCAATC C CTATAAGCATAGGAG C 

ik"qkikgkasdaklgifvss 
atgaaggaagacttgtctaacaataacgctataacgcaacaccrtatccccgtagaaggg 

6161 + + + + * 8220 

TACTTCCTTCTGAACAGATTGTTATTGCGATATTGCGTTGTGGAATAGGGGCATCTTCCC 
MKEDLSNNNAITQHL IPVEG 

GGTGTGGAGGTTGTGGATTTGACTAGCGACGAACTGGACGCAATCGTTGCACCATTCAGC 

8221 + + + + + + 8280 

CCACACCTCCAACACCTAAACTGATCGCTGCTTGACCTGCGTTAGCAACGTGGTAAGTCG 
GVEVVDLTSDELDAIVAPFS 

GCTAGGGCTGTGGAAGTATTCAAAACTGGTCTTGACAACTCT 

8281 " + -h— + + 8340 

CGATCCCGACACCTTCATAAGTTTTGACCAGAACTGTTGAAAATGGGTCTGGGCCAATAA 
ARAVEVFKTGLDNFYPDPV'I 

GCCGTTATGACTGGGGGGTCAAGTGCTCTAGTTAAGGTCAGGAGTGATGTGGCTAATTTG 

83 41 + + + + * + 8400 

CGGCAATACTGACCCCCCAGTTCACGAGATCAATTCCAGTC 

AVMTGGSSALVKVRSDVANL 

CCGCAGATATCTAAAGTCGTGTTCGACAGTACCGATTTTAGAT 

8401 ♦ + * * + * 84^0 

GGCGTCTATAGATTTCAGCACAAGCTGTCATGGCTAAAATCTACAAGCCACCGAACACCC 
PQISKVVFDSTDFRCSVACG 

GCTAAGGTTTACTG C GATACTTTGGCAGGTAATAGCGGACTGAG ACTGGTGGACACTTTA 

8461 ♦ * * + ♦ * 8520 

CGATTCCAAATGACGCTATGAAACCGTCCATTATCGCCTGACTCTGACCACCTGTGAAAT 

AKVYCDTLAGNSGLRLVDTL 

ACGAATACGCTAACGGACGAGGTAGTGGGTCTTCAGCCGGTGGTAATTTTC 

8521 * " ♦ * * + 8580 

TGCTTATGCGATTGCCTGCTCCATCACCCAGAAGTCGGC 

TN TLTDEVVGLQPVVIFPKG 

AGTCCAATACCCTGTTCATATACTCATAGATACACAGTGGGTGGTGGAGATGTGGTATAC 

8S81 + h a + + 3640 

TCAGGTTATGGGACAAGTATATGAGTATCTATGTGTCACCCACCACCTCTACACCATATG 
SPIPCSYTHRYTVGGGDVVY 

GG TATATTTGAAGGGGAGAATAACAGAGCTTTTCTAAAT 

8641 + + + < + + 8700 

CCATATAAACTTCCCCTCTTATTGTCTCGAAAAGATTTACTCGGCTGCAAGGCCCCGCAT 

GX FEGENNRAFLNE PTFRGV 

TCGAAACGTAGGGGAGACCCAGTAGAGACCGACGTGGCGCAGTTTAATCTCTCCACGGAC 

8701 1- + + * + + 8760 

AGCTTTGCATCCCCTCTGGGTCATCTCTGGCTGCACCGCGTCAAATTAGAGAGGTGC 

SKRRGDPVETDVAQ FNLSTD 
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GGAACGGTGTCTGTTATCGTTAATGGTGAGGAAGTAAAGAATGAATATCTGGTACCCGGG 
3751 -r * + + + ag20 

CCTTGCCACAGACAATAGCAATTACCACTCCTTCATTTCTTACTTATAGACCATGGGCCC 
GTVSVIVNGEEVKNEYLVPG 

ACAACA^CGTACTGGATTCATTGGTCTATAAATCTGGGAGAGAAGATTTAGAGGCTAAG 
8821 * + + + + 888c 

TGTTGTTTGCATGACCTAAGTAACCAGATATTTAGACCCTCTCTTCTAAATCTCCGATTC 
TTNVLDSLVYKSGREDLEAK 

GCAATACCAGAGTACTTGACCAC\CTGAATATTT^ 
8881 + + + + + + 8g40 

CGTTATGGTCTCATGAACTGGTGTGACTTATAAAACGTGCTATTCCGAAAGTGCTCCTCT 
AIPEYLTTLNILHDKAFTRR*. 

AACCTGGGTAACAAAGATAAGGGGTTCTCGGATT^^ 

8941 + + + + + + 9000 

TTGGACCCATTGTTTCTATTCCCCAAGAGCCTAAATTCCTArCTTCTTTTAAA 
NLGKKDKGFSDLRIEEN FLK 

ORT5 (E5790 Hosxolog) 

TCCGCCGTAGATACAGACACGATTTTGAATGGATAAATATATTTATGTAACGGGGATATT 
9001 + * + + + + 906O 

AGGCGGCATCTATGTCTGTGCTAAAACTTACCTATTTATATAAATACATT.GCCCCTATAA 
SAVDTDTI LNG * 

MDKYIYVTG I L - 

AAACCCTAACGAGGCTAGAGACGAGGTATTCTCGG7AGTGAATAAGGGATATATTGGACC 

9061 + + + + + -k 9120 

TTTGGGATTGCTCCGATCTCTGCTCCATAAGAGCCATCA 
NPNEARDEVFSVVNKGYIGP - 

GGGAGGGCGCTCCTTTTCGAATCGTGGTAGTAAGTACACCGTCGTCTGGGAAAACTCTGC 

9121 + + + + + + 9180 

CCCTCCCGCGAGGAAAAGCTTAGCACCATCATTCATGTGGCAGCAGACCCTTTTGAGACG 
GGRSFSNRGSKYTVVWENSA 

TGCGAGGATTAGTGGATTTACGTCGACTTCGCAATCTACGATAGATGCTTTCGCGTATTT 
9181 + + + h + + 9240 

acgctcctaatcacctaaatgcagctgaagcgttagatgctatctacgaaagcgcataaa 
ari sgftstsqstidafayf 

cttgttgaaaggcgc^ttgactaccacgctctct^ 

9241 -h + + + + «. 9300 

GAACAACTTTCCGC CTAACTGATGGTGCGAGAGATTGGGTTATTTGACACTCTTAACC 
LI.KGGLTTTLSNPINCENWV- 

CAGGTCATCTAAGGATTTAAGCGCGTTTTTCAGGACCCTAATTAAAGCT 

9301 + + + * + + 93 60 

GTCCAGTAGATTCCTAAATTCGCGCAAAAAGTCCTGGGAT7AATTTCCATTCTAAATACG 
RSSKDLSAFFRTLIKGKIYA- 

ATCGCGTTCTGTGGACAGCAATCTTCCAAAGAAAGACAGGGATGACATCATGGAAGCGAG 

9361 * h + + + * 9420 

TAGCGCAAGACACCTGTCGTTAGAAG G lTrCTTT C TGTCCCTACTGTAGTACCTTCGCTC 
SRSVDSNLPKKDRDDIMEAS- 
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TCGACGACTATCGCC^TCGGACGCCGCCTTTTGCAGAGCAGTGTCGGTTCAGGTAGGGAA 

9422. + «■ ^ * * 9480 

AGCTGCTGATAGCGGTAGCCTGCGGCGGAAAACGTCTCGTCACAGCCAAGTCCATCCCTT 
) RRLSPSDAAFCRAVSVQVGK- 

• GTATGTGGACGTAACGCAGAATTTAGAAAGTACGATCGTGCCGTTAAGAGTTATGGAAAT 

9481 * * + + + ^ 9540 

C\TACACCTGCATTGCGTCTTAAATCTTTCATGCTAGCACGGCAATTC7CAATACCTTTA 
YVDVTQNL EST IVP LRVME I - 

AAAGAAAAGACGAGGATGAGCACATGTTAGTTTAC 
9541 + + + * * + 9600 

KKRRGSAHVSL PKVVSAYVD 

TTTTTATACGAACTTGCAGGAATTGCTGTCGGATGAAGTAACTAGGGCCAGAACCGATAC 

9601 + + + + * + 9660 

AAAAATATGCTTGAACGTCCTTAACGACAGCCTACTTCATTGATCCCGGTCTTGGCTATG 
FYTNLQELLrSDEVTKARTDT - 

AGTTTCGGCATACGCTACCGACTCTATGGCTTTCTTAGTTAAGATGTTACCCCTGACTGC 

9^61 + + + ► «■ + 9720 

TCAAAGCCGTATGCGATGGCTGAGATACCGAAAGAATCAATTCTACAATGGGGACTGACG 
VSAYATDSMAFLVKMLPLTA- 

TCGTGAGCAGTGGTTAAAAGACGTGCTAGGATATCTGCTGGTACGGAGACGACCAGCAAA 

9721 + * + 1 + + 97B0 

AGCACTCGTCACCAATTTTCTGCACGATCCTATAGACGACCATGCCTCTGCTGGTCGTTT 
REQWLKDVLGYLDVRRRPAN - 

TTTTTCCTACGACGTAAGAGTAGCTTGGGTATATGACGTGATCGCTACGCTCAAGCTGGT 

9781 * + + + + + 9840 

AAAAAGGATGCTGCATTCTCATCGAACCCATATACTGCACTAGCGATGCGAGTTCGACCA 
F S YDVRVA WVYDV I A T LKL V - 

CATAAGATTGTTTTTCAACAAGGACACACCCGGGGGTATTA 

9841 * * + + 9900 

GTATTCTAACAAAAAGTTGTTCCTGTGTGGGCCC^ 
IRLFFNKDTPGGIKDLKPCV- 

GCCTATAGAGTCATTCGACCCCrrTCACGAGCTTTCGTCCTATTTCTCT 

9901 1 + + h + + 9960 

CGGATATCTOIGTAAGCTGGGGAAAGTGCTCGAAAGCAG 
P I ESFDPFHELSSYF SR LSY 

CGAGATGACGACAGGTAAAGGGGGAAAGATATGCCCGGAGATCGCCGAGAAGTTGGTGCG 

9961 + + + * + + 10020 

GCTCTACTGCTGTCCATTTCCCCCTTTCTATACGGC^ 
EMTTGJCGGKICPEIAEKLVR - 

CCGTCTAATGGAGGAAAACTATAAGTTAAGATTGACCCCAGTGATCGCCTTAATAATTAT 

10021 + •* + * * + 10080 

GGCAGATTACCTCCTTTTGATATTCAATTCTAACTGGGGTCACTAC 
RLMEENYKLRLTFVMALIII- 
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AC TGGTATAC TACTC CATTTACGGCAC AAAC GCTACCAGGATT AAAAGACG C CCGGA'l'TT 
3.00 S 1 * f-- - -r ■!• ■*■ 10140 

L V Y Y S I YGTNATRIKRRPDF - 

CCTCAATGTGAGGATAAAGwAAGAGTCGAGAAGGTTTCGTTACGGGGGOTAGAAGATCG 

10141 k'gttacactcctat^cccttctcagct^ 

iT n v r i kgrvekvslrgveor - 

tgcctttagaatatcagaaaagcgcgggataaacgctcaacgtgtattatgtaggtacta 

10201 » * * + + 10260 

acggaaatcttatagtottttcgcgccctatttgcgagttgcacataatacatccatgat 

AFRIS E KRGINAQRVLCRYY - 

tagcgatctcacatgtctggctaggcgacattacggcattcgcaggaacaattggaagac 
10261 atcgctagagtcta^gaccgatccgctgtaatgccgtaagcgtccttgtt^ 

S D LTCLARRHYGIRRNN.WKT- 
GCTGAGTTATGTAGACGGGACGTTAGCGTATGACACGGCTGATTGTATAACrTCTAAGGT 

X0321 * + * + + 10380 

CGACTCAA^ACATCTGCCCTGCAATCGCATACTGTGCCGACTAACATATTGAAGATTCCA 

lsy'vdgtlaydtadcitskv - 

GAGAAATACGAK^CACCGCAGATCACGCTAGCATTATACACTATA 

103 81 — — — — * +~ 

CTClTTATGCTAGTTC^GGCGTCTAGTGCGATCGTAATATCnX^TATAGTTCTGCYGCT 

rntintadhasiihyiktne 

AAACCAGGTTACCGGAACTACTCTAC^ 

10441 raGG^ccAATGGc^T^ 

NQVTGTTLPHQt-* 
GATGTTTCTCGTATTACOTTA^^ 

10501 ctac^gag^taatc^tat^aaIaattaacgaga 

QKF6 (Coat protsiix) 

gtgaacgcgatggcatttgaactgaaattagggcagatatat iog2o 

10561 cac^ktaccgtaaist^^ 

mafelklgqiyevvpem - 

AATTTGAGAGTTAGAGfrGGGGGATGCGGCACAAGG^ 

NLRVR VGDAAQGKF SKASFL - 
AAGTACGTTAAGGACGGGACACAGGCGGAATTAACGGGAATCGCC ^ 

10681 raTGC^rec^cc^cc^^ 

kyvkdgtqaeltgiavvpek 
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TACGTATTCGCCACAGCAGCTTTGGCTACAGCGGCGCAGGAGCCACCTAGGCAGCCACCA 

10741 + + + * 10800 

ATGCATAAGCGGTGTCGTCGAAACCGATGTCGCCGCGTCCTCGGTGGATCCGTCGGTGGT 

YVFATAALATAAQEPPRQPP 

GCGCAAGTGGCGGAACC\CAGGAAACCGATATAGGGGTAGTGCCGGAATCTGAGACTCTC 

10801 + + + + + 10860 

CGCGTTCACCGCCTTGGTGTCCTTTGGCTATATCCCCATCACGGCCTTAGACTCTGAGAG 

AQVAEPQETDIGVVPESETL 

ACACCAAATAAGTTGGTTTTCGAGAAAGATCCAGACAAGTTCTTGAAGACTATGGGCAAG 

10861 * 1 * + + + 10920 

TGTGGTTTATTCAACCAAAAGCTCTTTCTAGGTCTGTTCAAGAACTTCTGATACC 
TPNKLVFEKDPDKFLKTMGK 

GGAATAGCTTTGGACTTGGCGGGAGTTACCCACAAACCGAAAGTTATTAACGAGCCAGGG 

10921 + * + + + + 10980 

CCTTATCGAAACCTGAACCGCCCTCAATGGGTGTTTGGCTTTCAATAATTGCTCGGTCCC 
GIALDLAGVTHKPKVINEPG 

AAAGTATC^GTAGAGGTGGCAATGAAGATTAATGCCGCATTGATGGAGCTGTGTAAGAAG 

1098! + + + + * + 11040 

TTTCATAGTCATCTCC\CCGTTACTTCTAATTACGGCGTAACTACCTCGACACATTCTTC 
KVSVEVAMKINAALMELCKK 

GTTATGGGCGCCGATGACGCAGCAACTAAGACAGAATTCTTCTTGTACGTGATGCAGATT 

U041 + * * + * * 11100 

CAATACCCGCGGCTACTGCGTCGTTGATTCTGTCTTAAGAAGAACATGCACTACGTCTAA 

VMGADDAATKT EFFLYVMQI 
GCTTGCACGTTCTTTAC^TCGTCTTCGACGGAGTTCAAAGAGTTTGACTACA 

11101 H - + * * + 11160 

CGAACGTGCAAGAAATGTAGCAGAAGCTGCCTCAAGTTTCTCAAACTGATGTATCTTTGG 
AC TFFTSSSTEFKEFDYIET 

GATGATGGAAAGAAGATATATGCGGTGTGGGTATATGATTGCATTAAACAAGCTGCTGCT 

1H61 * + + + + + H220 

CTACTAC CTTTC TTCTATATAC GC CACAC C CATATACTAACGTAATTTGTTCGACG ACGA 
DDGKKIYAVWVYDCIKQAAA 

TCGACGGGTTATGAAAACCCGGTAAGGCAGTATCTAGCGTACTTCACACCAACCTTCATC 

11221 - + * * + + 11280 

AGCTGCCCAATACTTTTGGGCCATTCCGTCATAGATCGCATGAAGTGTGGTTGGAAGTAG 

STGYENPVRQYLAYFTPTFI 

ACGGCGACCCTGAATGGTAAACTAGTGATCAACGAGAAGGTTATGGCACAGCA 

11281 + * + + * 11340 

TGCCGCTGGGACTTACCATTTGATCACTACTTGCTCTTCCAATACCGTGTC 

TA T LN G K LVMNE KVMA QHGV 

CCACCG^TTCTTTCCGT^ 

11341 + * + * * 11400 

GGTGGCTTTAAGAAAGGCATGTGCTATCTGAC 

PPKFFPYTIDCVRPTYDLFN 
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AACGACGCAATATTAGCATGGAATTTACCTAGACAGCAGGCGTTTAGAAACAAGACGGTA 

11401 + + -t- + «- -i- 11460 

TTGCTGCGTTATAATCGTACCTTAAATCGATCTGTCGTCCGCAAATCTTTGTTCTGCCAT 
NDAILAWNLARQQAFP. NKTV 

ACGCCCGATAACACCTTACACAACGTCTTCCAACTATTGCAAAAGAAGTAGCTACGATCG 

11461 + * * + ^ 11520 

TGCCGGCTATTGTGGAATGTGTTGCAGAAGGTTGATAACGTTTTCTTCATCGATGCTAGC 
TADNTL HNVFQ L LQ K K "* 

ORF7 (CTr) 

ATGTCTATAAATTGGTGAAAAATTTAGAAATATTTACCTTTTATTGATAATTCATGG 

11521 + h + + + + 11580 

TACAGATATTTAACCACTTTTTAAATCTTTAT^^ 
M S I N W * 

M G A - 

CTTATACACATGTAGACTTTCATGAGTCGCGGTTGCTGAAAGA 

11S81 + + * «. + 11640 

GAATATGTGTACATCTGAAAGTACTCAGCGCCAACGACTTTCTGTTTGTTCTGATAGAAA 
YTHVDFHE SRLLKDXQDYL S - 

CTTTCAAGTCAGCGGATGAAGCTCCTCCTGATCCTCCCGGATACGTTCGCCCAGATAGTT 

11641 + * + + + + 11700 

GAAAGTTCAGTCGCCTACTTCGAGGAGGACTAGGAGGGCCTATGCAAGCGGGTCTATCAA 
FKSADEAP PDPPGYVRPDSY- 

ATGTGAGGGCTTATTTGATAC^AAGAGCAGACTTTCCCAATACTCAAAGCTTATCAGTTA 

11701 «• + + + + + 11760 

TACACTCCCGAATAAACTAT G TTTCTCGTCTGAAAGGG7TATGAGTTTCGAATAGTCAAT 
VRAYLIQRADFPNTQSLSVT- 

CGTTATCGATAGCCAGTAATAAGTTAGCTTCAGGTCTTATGGGAAGCGACGCAGTATCAT 

11761 — ■ + + * + + + 11820 

GCAATAGCTATCGGTCATTATTCAATCGAAGTCC^GAATACCCTTCGCTGCGTCATAGTA 
LSIASNKLASGLMGSDAVSS- 

CGTCGTTTATGCTGATGAACGACGTGGGAGATTACTTC^ 

11821 + + + + + + 11880 

GCAGCAAATACGACTACTTGCTGCACCCTCTAATGAAGCTCACGCCGCACACAGTGTTGT 
S FMLMNDVGDYFECGVCHNK- 

AACCCTACTTAGGACGGGAAGTTATCTTCTGTAGGAAATACATAGGTGGGAGAGGAGTGG 

11881 + + + + + + 11940 

TTGGGATGAATCCTGCCCTTCAATAGAAGACATCCTTTATGTATC 

PYLGREVI FCRKY I GGRGVE- 

AGATCACCACTGGTAAGAACTACACGTCGAACAATTGGA^ 

11941 + + + * * + 12000 

TCTAGTGGTGACCATTCTTGATGTGCAGCTTGTTAACCTTC 

I TTGKNYT SNNWNEASYVI Q - 

AAGTGAACGTAGTCGATGGGTTAGCACAGACCACTGTTAATTCT 

12001 + «■ + * + + 12060 

TTCACTTGCATCAGCTACCCAATCGTGTCTGGTGACAATTAAGATGAATA 

VNVV DG LA QTTVN S TYTQT D - 
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ACGTTAGTGGTCTACCCAAAAATTGGACGCGTATCTACAAAATAACAAAGATAGTGTCCG 

1206 1 - + + + 12120 

TGCAATCACCAGATGGGTTTTTAACCTGCGCATAGATGTTTTATTGTTTCTATCACAGGC 

VSGLPKNWTRIVKITKIVSV- 

TAGATCAGAACCTCTACCCTGGTTGTTTCTCAGACTCGAAACTGGGTGTAATGCGTATAA 

1-1121 * " + + * * 12180 

ATCTAGTCTTGGAGATGGGACCAACAAAGAGTCTGAGCTTTGACCCACATTACGCATATT 

DQN LYPGCFSDSKLGVMR IR- 

GGTCACTGTTAGTTTCCCCAGTGCGCATCTTCTTrAGGGATATCTTATTGAAACCTTTGA 

12181 + + + + + ' + 12240 

CCAGTGACAATCAAAGGGGTCACGCGTAGAAGAAATCCCTATAGAATAACTTTGGAAACT 
SLLVSPVRIFFRD I LLKPLK- 

AGAAATCGTTCAACGCAAGAATCGAGGATGTGCTGAATATTGACGACACGTCGTTGTTAG 
12241 + + + * 12300 

t^tttagcaagttgcgttcttagctcctacacgacttataactgctgtgcag 
"ksfnariedvlniddtsllv- 

TACCGAGTCCTGTCGTACCAGAGTCTACGGGAGGTGTAGGTCCATCAGAGCAGCTGGATG 

12301 + ♦ ♦ + + * 12360 

ATGGCTCAGGACAGCATGGTCTCAGATGCCCTCCACATCCAGGTAGTCTCGTCGACCTAC 

PSPVVPESTGGVGPSEQLDV- 

TAGTGGCTTTAACGTCCGACGTAACGGAATTGATCAACACTAGGGGGCAAGGTAAGATAT 

l 23e i + + «■ * * 12420 

ATCACCGAAATTGCAGGCTGCATTGCCTTAACTAGTTGTGATCCCCCGTTCCATTCTATA 

valtsdvtelintrgqgkic- 

GTTTTCCAGACTCAGTGTTATCGATCAATGAAGCGGATATCTACGATGAGCGGTATTTGC 

12421 + + + + + * 12480 

CAAAAGGTCTGAGTCACAATAGCTAGTTACTTCGCCTATAGATGCTACTCGCCATAAACG 

F PDSVLS INEADIYDERYLP- 

CGATAACGGAAGCTCTACAGATAAACGCAAGACTACGCAGACTCGTTCTTTCGAAAGGCG 

12481 * + * * * * 12540 

GCTATTGCCTTCGAGATGTCTATTTGCGTTCTGATGCGTCTGAGCAAGAAAGCTTTCCGC 

XTEALQINARLRRLVLSKGG- 

-GGAGTCAAACACCACGAGATATGGGGAATATGATAGTGGCCATGATAC^ 

12541 * * + ► + + 12600 

CCTCAGTTTGTGGTGCTCTATACCCCTTATACTATCACCGGTACTATGTTGAAAAGCATG 

S QTPRDMGNMIVAMIQLFVL- 

TCTACTCTACTGTAAAGAATATAAGCGTCAAAGACGGGTATAGGGTGGAGACCGAATTA 

12S01 < < * * + + 12660 

AGATGAGATGACATTTCTTATATTCGCAGTTTCTGCCCATATC^ 

YSTVKNISVKDGYRVETELG- 

GTCAAAAGAGAGTCTACITAAGTTATTC 

12661 . h * + < * 1"20 

CAGTTTTCTCTCAGATGAATTCAATAAGCCTTCATT 

QKRVYLSYSEVREAILGGKY- 
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ACGGTGCGTCTCCAACCAACACTGTGCGATCCTTCATGAGGTATTTTGCTCACACC\CTA 

12721 * + * 12780 

TGCCACGCAGAGGTTGGTTGTGACACGCTAGGAAGTACTCCATAAAACGAGTGTGGTGAT 

c GASPTMTVRSFMRYFAHTTI- 

TTACTCTACTTATAGAGAAGAAAATTCAGCCAGCGTGTACTGCCCTAGCTAAGCACGGCG 

12761 + * + * * 12840 

AATGAGATGAATATCTCTTCTTTTAAG7CGGTCGCACATGACGGGATCGATTCGTGCCGC 

c TLLIEKKIQPACTALAKKGV- 



c 



TCCCGAAGAGGTTCZACTCCGTACTGCTTCGACTTCGCACTACTGGATAACAGATATTACC 

. + — — 4— — — — — — — — + - — — — + -♦- 

AGGGCTTCTCCAAGTGAGGCATGACGAAGCTGAAGCGTGATGACCTATTGTCTATAATGG 



PKRFT P YC FDFALLDNRYYP- 



CGGCGGACGTGTTGAAGGCTAACGCAATGGCTTGCGCTATAGCGATTAAATCAGCTAATT 

12901 + — + * * * 12960 

GCCGCCTGCACA\CTTCCGATTGCGTTACCGAACGCGATATCGC7AATTTAGTCGATTAA 
c ADVLKANAMACAIAIKS ANL- 

ORF8 

TAAGGCGTAAAGGTTCGGAGACGTATAACATCTTAGAAAGCATTTGATTATCTAAAGATG 

12961 + + + * * 13020 

ATTCCGCATTTCCAAGCCTCTGCATATTGTAGAATCTTTCGTAAACTAATAGATTTCTAC 

M * 

c RRKGSETYNILESI* 

GAATTCAGACCAGTTTTAATTACAGTTCGCCGTC^^ 

13021 + + + H * 13080 

CTTAAGTCTGGTCAAAATTAATGTCAAGCGGCACTAGGGCC 

a EFRPVL JTVRRDPGVNTGSL 

AAAGTGATAGCTTATGACTTACACTACGACAATATATTCGATAACTGCGCGGTAAAGTCG 
13081 + + + * + * 13140 

tttcactatcgaatactgaatgtgatgctgttatataagctattgaccx:gccatttcagc 

a KVIAYDLHYDNIFDNCAVKS 

TTTCGAGACACCGACACTGGATTC^TGTTATGAAAGAATACTCGACGAAT^ 
13141 — — +■— — — ———4-—— 13200 

AAAGCTCTGTGGCTGTGACCTAAGTGACAATACTTTCTTATGAGCTGCTTAAGTCGCAAG 
a FRDTDTGFTVMKEYSTNSAF 

ATACTAAGTCCTTATAAACTGTTTTCCGCGGTCTTTAATAAGGAAGGTGAGATGATAAGT 

13201 1 + + + * + 13260 

TATGATTCAGGAATATTTGACAAAAGGCGCCAGAAATTATTCCTTCCACTCTACTATTCA 

a ILSPYKLFSAVFNKEGEMIS - 

AACGATGTAGGATCGAGTTTCAGGGTTTACAATATCTTTTCGCAAATGTGTAAAGATA 

13261 „+__— +• + ♦ * 13320 

TTGCTACATCCTAGCTCAAAGTCCCAAATGTTATAGAAAAGCGTTTACACATTTCTATAG 
a NDVGSSFRVYNIFSQMCKDI 

aacgagatcagcgagatacaacgcgccggttacctagaaacatatttag^ ^ BQ 

13 321 TTGCTCTliGTCCOT 
a NEISEIQRAGYLETYLGDGQ - 
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GC^GACACTGATATATTTTTTGATGTCTTAACCIAACAACAAAGCAAAGGTAAGGTGGTTA 

13381 -~ + * — -* 13 "° 

CGACTGTGACTATATAAAAAACTACAGAATTGGTTGTTGTTTCGTTTCCATTCC^CCAAT 

a AD TD I F F ' D V L TNNKAKVRWL 

GTTAATAAAGACCATAGCGCGTGGTGTGGGATATTGAArGATTTGAAGTGGGAAGAGAGC 

13441 + * * + * 13S00 

CAATTATTTCTGGTATCGCGCACCACACCCTATAACTTACTAAACTTCACCCTTCTCTCG 
a VNKDH SAWCG ILNDLKWEES 

AACAAGGAGAAATTTAAGGGGAGAGACATACTAGATACTTACGTTTTATCGTCTGATTAT 

135 0i k + * * + + 13560 

TTGTTCCTCTTTAAATTC CC CTCTCTGTATG ATCTATG AATGC AAAAT AGCAG ACTAATA 
a NK EKFKGRDILDTYVLSSDY 

ORT9 

CCAGGGTTTAAATGAAGTTGCTTTCGCTCCGCTATCTTATCTTAAGGTTGTCAAAGTCGC 

13561 < 4- * * * + 13620 

GGTCCCAAATTTACTTCAACGAAAGCGAGGCGATAGAATAGAATTCCAACAGTTTCAGCG 

a P G F K * ~ ' 

c MKLLSLRYLILRLSKS L- 

TTAGAACGAACGATCACTTGGTTTTAATACTTATAAAGGAGGCGCTTATAAACTATTACA 

1362 l + + * + + + 13 680 

AATCTTGCTTGCTAGTGAACCAAAATTATGAATATTTCCTCCGCGAATATTTGATAATGT 

c * RTNDHLVLIL IXEALINYYN- 

ACGCCTCTTTCACCGATGAGGGTGCCGTATTAAGAGACTCTCGCGAAAGTATAGAGAATT 

13 681 * * + + * + 13740 

TGCGGAGAAAGTGGCTACTCCCACGGCATAATTCTCTGAGAGCGCTTTCATATCTCTTAA 

c A S FT D EGAV L RD S RES I EN F - 

TTCTCGTAGCCAGGTGCGGTTCGCAAAATTCCTGCCGACT 

13741 + 4. + + + + 13 800 

AAGAGCATCGGTCCACGCCAAGCGTTTTAAGGACGGCTCAGTACTTCCGAAACTAGTGAT 
c LVARCGSQNSCRVMKALITN- 

ACACAGTCTGTAAGATGTCGATAGAAACAGCCAGAAGTTTTATCGGAGACI^ 

13B 01 « + - * + 13860 

TGTGTCAGACATTCTACAGC TATCTTTGTCGGTCTTCAAAATAGC CTCTGAATTATGAGC 
c TVCKMSIETARSF IGDLI L V - 

TCGCCGACTCCTCTGTTTCAGCGTTGGAAGAAGCGAAATCAATTAAAGATAATTTC 

138S1 + + + < * + 13920 

AGCGGCTGAGGAGACAAAGTCGCAACCTTCTTCGCTTTAGT^ 

c ADSSVSALEEAKS IKDNFRL- 

TAAGAAAAAGGAGAGGCAAGTATTATTAIAGTGGTGATTGTGGATCC^ 

1392 1 ♦ + + -+ + + 13980 

ATTCTTTTTCCTCTCCGTTCATAATAATATCACCACTAACAC 
c RKRRGKYYYSGDCGSDVA XV- 

TTAAGTATATTTTGTCTGGGGAGAATCGAGGATTGGGGTGCGTAGA 

. 13981 + * 4- + h + 14040 

• AATTCATATAAAACAGACCCCTCTTAGCTCCTAACCCCACGCATCTAAGGAACTTCGATC 
c KYILSGENRGLGCVDSLKLV- 
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TTTGC3TAGGTAGACAAGGAGGTGGA-AACGTACTACAGCACCTACTAATCTCATCTCTGG 

14041 h * + ^ * .14100 

AAACGCATCCATCTGTTCCTCCACCTTTGCATGATGTCGTGGATGATTAGAGTAGAGACC 
CVGRQGGGNVLQHLLISSLG- 

ORF10 

GTTAAAGCATCATGGACCTATCGTTTATTATTGTGCAGATCCTTTCCGCCTCGTACAATA 

14101 + + + + + + 14160 

CAATTTCG TAGTACCTGGATAGCAAATAATAAC ACGTC TAGGAAAGG C GGAGC ATGTTAT 

MDLSFIIVQILSASYNN- 

ATGACGTGACAGCACTTTACACTTTGATTAACGCGTATAATAGCGTTGATGATACGACGC 

14161 + + •* + + 14220 

TACTGCACTGTCGTGAAATGTGAAACTAATTGCGCATATTATCGCAACTACTATGCTGCG 
DVTALYTL INAYNSVDDTTR- 

GCTGGGCAGCGATAAACGATCCGCAAGCTGAuGTTAACGTCGTGAAGGCTTACGTAGCTA 

14221 + + «■ + : + 14280 

CGACCGGTCGCTATTTGCTAGGCGTTCGACTCCAATTGCAGCACTTCCGAATGCATCGAT 
WAAINDP Q A EVN VVKAYVAT- 

CTACAGCGACGACTGAGCTGCATAGAACAATTCTCATTGACAGTATAGACTCCGCCTTCG 

14281 + * + + + f 14340 

GATGTCGCTGCTGACTCGACGTATCTTGTTAAGAGTAACTGTCATATCTGAGGCGGAAGC 
TATTELHRT ILIDS I D S A F A - 

CTTATGACCAAGTGGGGTGTTTGGTGGGCATAGCTAGAGGTTTGCTTAGACATTCGGAAG 

14341 + + + + «■ + 14400 

GAATACTGGTTCACCCCACAAACCACCCGTATCGATCTCCAAACGAATCTGTAAGCCTTC 
YDQVGC LVGIARGLLRHSED- 

ATGTTCTGGAGGTCATCAAGTCGATGGAGTTATTCGAAGTGTGTCGTGGAAAGAGGGGAA 

14401 -t- + + + + + 14460 

TACAAGACCTCCAGTAGTTCAGCTACCTCAATAAGCTTCACACAGCACCTTTCTCCCCTT 
VLEVI KSMELFEVCRGKRG S - 

GCAAAAGATATCTTGGATACTTAAGTGATCAATGCACTAACAAATACATGATGCTAACT^ 

14461 + + + + + + 14520 

CGTTTTCTATAGAACCTATGAATTCACTAGTTACGTGATTGTTTATGTACTACGATTGAG 
KRYLGYLS DQCTNKYMMLTQ- 

AGGCCGGACTGGCCGCAGTTGAAGGAGCAGACATACTACGAACGAATCATCTAGTCAGTG 

14521 + + + + * + 14580 

TCCGGCCTGACCGGCGTCAACTTCCTCGTCTGTATGATGCTTGCTTAGTAGATCAGTCAC 
AGLAAVEGADI L RTNHLVSG- 

GTAATAAGTTCTCTCCAAATTTCGGGATCGCTAGGATGTTGCTCTTGACGCT 

14S81 -i + + * * * 14640 

CATTATTCAAGAGAGGTTTAAAGCCCTAGCGATCCTACAA 

NKFSPNFG IARMLLLTLCCG- 

GAGCACTATAAAAATGTTATGTTGTTCAGCCAGTCT 

14641 h * + -i * * 14700 

CTCGTGATATTTTTACAATACAACAA 
A L * 
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ATCGCTACTTATTTGCGCATGTTTGTTAGCGG7GCTAATTGTTAGCTTTTGTAGAAGGCG 

14701 + * + **" + 14760 

TAGCGATGAATAAACGCGTACAAACAATCGCCACGATTAACAATCGAAAACATCTTCCGC 

ORF11 

ATGAGGCACTTAGAAAAACCCATCAGAGTAGCGG7ACACTATTGCGTCGTGCGAAGTGAC 

14761 + + + + 14820 

TAC7CCG7GAA7C7TT7TGGG7AGTC7CATCGCCATG7GATAACGCAGCACGCTTCACTG 
MRKLEKPIRVA. VHYCVVRSO 

GTTTGTGACGGGTGGGATGTATTTATAGGCGTAACGTTAATCGGTATGTTTATTAGTTAC 

14821 + + + * + + 14880 

CAAACACIXXCCACCCTACATAAATATCCGCATTGCAATTAGCC^^ 
VCDGWDVF IGVTLIGMFISY 

TATTTATATGCTCTAATTAGCATATGTAGAAAAGGAGAAGGTTTAACAACCAGTAAT^ 

14881 «► + + * + * 14940 

ATAAATATACGAGATTAATCGTATACATCTTTTC 

YLYALISICRKGEGLTTSNG 

TAAAAATCCTTCAATAAATTTGAAATAAACAAAAGTAAGAAAAATGAAA 

14941 + + + + + + 15000 

ATTTTTAGGAAGTTATTTAAACTTTATTTGTTTTCATTCTTT^ 

GTCTTTTTGTTCGTCTTTCGCTTTTGTAGAATAGGTTTTA 

15001 + + + + + + 15060 

CAGAAAAACAAGCAGAAAGCGAAAAC^TCTTATCCAAAATAAAGCTCCATTCTACTGATT 

ACTCTACCTCACGGTTTAATACTCTGATATTTGTAAAATTAGTCCGTAAAGTCAGATAGT 

15061 * * * + + + 15120 

TGAGATGGAGTGCCAAATTATGAGACTATAAACATTTTAATCAGGCATTTCAGTCTATCA 

GATATTATATTAGTATAGTATAATAAACGCCAAAATCCAATCAAAGTTTGGC^ 

15121 +■ + + + * * 15180 

CTATAATATAATCATATCATATTATTTGCGGTTTTAGGTTAGTTTCAAAC 

GGGCCTCTTATGAGGCTAACTTATCGACAATAAGTTAGGTCCGCCAC 

15181 + + + + 15227 

CCCGGAGAATACTCCGATTGAATAGCTGTTATXCAATCCAGGCGGTG 
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TABLE 5 

Amino Acid Seamence Alignment for Helicases Encoded by 
GLRaV-3, BYV, "CTV and LIYV. 



_r(A). 



BYV_HZL FTFTNLSANV LLYEAPPGGG KTTTLIXVFC ETFSK.VNSL I L TANKS S RE 

CTVJiEL LTFTNEEHSL XVYEAPPGGG KTHSLVNSYA DYCVK.VSCL WTANKNSQT 

GLRaV3_HEL VGESFKSFSY KCYNAPPGGG KTT MLV DEFVKSPNST ATITANVG3S 

LXYV_EEL MVRRPDVNGL KFYNKPPGAG KTTTLAJOMS KDLKNKVKCL ALSYTKVGRL 

CONSENSUS — Y-aPPGaG KTt d-f-Jc-v— 1 



BYV.HZL EILAKVNRIV LD. . . EGDTP LQTHDRXLTI DSYLMNNR.G LTCKVLYLDE 

CTV_HEL EISQRISNEL MGRKLAAKYV TDAASRVFTV DSYLMNHL.R LTTQLLFIDE 

GLRaV3_HEL EDXNM. . . .A VKKR. . . DPN LEGI*NSATTV NSRWNFIVR. GZ*YKRVLVDE 

LIYV_HEL ELXDKLKKDG IEKP...EKY VKTYDSFLMN NDNXLSXV. KLYCDE 



CONSENSUS E- 



-lcv -s — mn- 



— ly-DE 



IXI 

BYV.HEL CFMVHACAAV ACXEFTKCDS AILFGDSRQI RYGRCSELDT AVLSDLNRFV 

CTV_HEL CFMVHAGAIG AWEFTSCKA WFFGDSKQI KYXHRNDLGV SFVADIDAFX 

GLRaV3_HEL VYMMHQGLLQ LGVTATGASE GLFFGDINQI PFINREKVFR MDCA. .VFVF 

liyvIhel VFMMHAGHFL TLLTKXAYQN GYCYGDVNQI PFINRDPYTP AYLS. .REFF 

CONSENSUS - f M-HaG c ££GD-- QI — i-r f- 



IV 

BYVJiEL DDESRVYGEV SYRCPWDVCA WLSTF YPKTVAT TNLVSAGQSS 

CTV_HEL QPEHRXYGcV SYRCPWDICE WLSEF YPRHVAT ANVGSIGKSS 

GLRaV3_HEL KKESWYTSK SYRCPLDVCY LLSSMTVRGT EKCYPEKWS GKDK.PWRS 
LIYV_KEL RKQDLNYDTY TYRCPLDTCY LLSNLKDEMG NIIYAGGVKN VNEVYPTIRS 

CONSENSUS — e--vY sYRCP-DvC- -LS-f Yp--V — -n S 



RW_H7T. MQVREIESVD DVEYSSEFVY LTMLQSEKXD LLKSFGK - „ R SRSSVEKPTV 

CTV.HEL VSIEEENGCD DVPYDKAAKY IVYTQAEKND LQKHLGRLTV GRNKV.VPIV 

GLRaV3~HZL LSXRPIGTTD DVAEINADVY LdTQLEKSD MKRSLKGKGK .ETP.„;..V 

LIYV_HEL LNLFGINWG EVPVEYNAKY LTFTQDEKLN LQRKXDSQGG CRN A V 

CONSENSUS 1 1 d dV Y L — tQ-EX-d 1 1 -r V 



BYVJIEL, 
CTV_HEL 
GLRaV3_HEL 
LICTJHEL 



LTVHEAQGET 
NTVHEVQGET 
MTVHEAQGKT 
STVNEAQGCT 



YRXVNLVRTK 
YKRVRLVRFK 
FSDWLFRTK 
FSEVNLVRLV 



FQEDDPFRSE NKITVALSRH VESLTYSVLS 
YQEDTPFSSSC NHXWALTRH VDSLVYSVLT 
KADDSLFTKQ PHXLVGLSRH TRSLVYAALS 
QFDNPVMSDI NQFWAISRH TTTFKYFTPH 



CONSENSUS -TVh£aQG-T V-LvR-Jc d — £ nhi-ValsRH — sl-Y — 1- 



BYV_HZL SKRDDAIAQA I 

CTV_HEL SRRYDDTATN I 

GLRaV3_KEL SKLDOKVGTY I 

LIYV_HZL SRLNDRVSNA I 

CONSENSUS S— D-v I 
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TABLE 6 





Hclicasc 


RdRp 






HSP70 


HSP90 


CP 




Vims 


m 


aa 


at 


aa 


at 


aa 


at 


aa 


DC 


aa 


nt 


aa 


BYV 


37.7 


38.4 
(58.1) 


44.5 


41 J! 
(61.0) 


42.0 


30.4 
(47.8) 


43.5 


28.6 
(48.0) 


40.5 


21.7 
(51.0) 


41.5 


20.3 
(43.7) 


CTV 


453 


36.3 
(55.2) 


44.0 


40.1 
QSL2) 


42.8 


20.0 
(48.9) 


43.7 


28.7 
(493) 


38.6 


17.5 
(43.5) 


40J 


20.5 
(41.9) 


LIYV 


44.9 


32.4 
(53.5) 


462 


35.9 
(56.4) 


45.8 


17.9 
(46.2) 


43.9 


28.2 
(46.9) 


39.3 


16.7 
(36.8) 


36.3 


17.8 
(41.1) 



Nucleotide (nt) and amino acid (aa) sequence similarity was 
calculated from perfect matches after aligning with the GCG 
program GAP; the* percentages in parentheses are the 
percentages calculated by the GAP program, which employs a 
matching table based on evolutionary conservation of amino 
acids (Devereux et al., wnrlPir Acids Res.. 12:387-395 
(1984) , which is hereby incorporated by reference) . The 
sources for the BYV, CTV, and LIYV sequences were, 
respectively, Agranovsky (1994), Karasev (1995), and Klaassen 
(1995) , which are hereby incorporated by reference. 



TABLE 7 



Sample # 


Accession # 


EL ISA * 


RT-PCR 


Indexing 


1 


476.01 


1.424 (+) 


+ 


+ 


2 


447.01 


0.970 (+) 


+ 


+ 


3 


123.01 


1.101 (+) 




+ 


4 


387.01 


>1.965 (+) 




+ 


5 


80.01 


>2.020 (+) 






6 


244.01 


>2.000(+) 


+ 


+ 


7 


441.01 


>2.000 ( + ) 




+ 


8 


510.01 


0.857 (+) 


+ 


+ 


9 


536.01 


0.561 (+) 




+ 


10 


572.01 


>2.000{+) 


+ 


+ 


11 


468.01 


>2 .000 (+) 


+ 


+ 


12 


382.01 


>2.000 (+) 


+ 


+ 


13 


NY1 


0.656 (+) 


+ 


+ 


14 


Healthy 


0.002 (-) 







Plus (+) and Minus {-) represent positive and negative reactions, 
respectively. For ELISA an 0D 4MM . that was at least twice higher 
than a healthy control, and more than 0.100 was regarded as 
positive. 
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TABLS 8 



Amino Acid Seauencs Alignment of RKA- dependent RNA 
Polymerase (RdRP) of GLRaV-3, BYV, CTV and LIYV. 



II. 



BYV_RdP.o ITTFKLtfVKR DAKVKLDSSC LVXHPPAQNI HFHRKAVNAI FSPCFDEFXN 

CTV_RdRp ISNFKLMVKR DAKVKLDDSS LSKKPAAQNT MFHKXFINAI FSPCFDEFKN 

GLRa V3 _RdRp LTSYTLMVKA DVKPKLDNTP LS2CYVTGQNI VYKDRCVTAL FSCIFTACVE 

LIYV_RdRp FKTLNLMVKG ETXFKMDLS? YDSYNAPANI VYYQQIVNLY FSPIFLZCFA 

CONSENSUS f-LMVX- d-K-KlD-s- 1-k qjNI h vna- FSp-F-e 



BYV.RdRp 
CTV_RdRp 
GLRaV3_RdRp 
LIYV.RdRp 



-HI. 



_IV 



RVTTCTNSNI VFFTEMTNST LASIAKZMLG . SEHVYNVGE IDFSKFDKSQ 

RVLSSLNDNI VFFTEMTNAG LAEIIRRIIG . DDDNLFVGE VDFSKFDKSQ 

RLKYWDERW LFYHGMDTAE LAXALRNNLG . DIRQYYTYE LDIS2CYDKSQ 

RLTYCLSDKI VLYSGMNTDV LAELIESKX.P LGLNAYHTLZ IDFSKFDKSQ 



CONSENSUS R- 



-d-i vf M LA Ig y E iDfSKf DKSQ 



BYV_RdRp 
CTV_RdRp 
GLRaV3_RdRp 
LIYV_RdRp 

CONSENSUS 



BYV_RdRp 
CTV_RdRp 
GLRaV3_RdRp 
LIYV_RdRp 



DAFIKSFERT LYSAFGFDED LLD.VWMQGE YTSNATTLDG QLSFSVDNQR 
DLFIXEYERT LYSEFGFDTE LLD.VWMEGE YRARATTLDG QLSF SVDGQR 
SALMKQVEEL ILLTLGVDRE VLS.TFFCGE YDSWRTMTK ELVL SVGSQR 
GTCFKLYEEM MYKMFGFSPE LYDRDFKYTE YTCRAKA.TC GVDLELGTQR 

~f-K-yE~ ly--fG£d-e lid gE Y a-tl 1 — sv — OR 



_VT. 



KSGASNTWXG NSIETLGILS MFYYTNRFKA LFVSGDDSLI FSESPIRNSA 
RSGGSNTWIG NSLVTLGILS LYYDVSKFDL LLVSGDDSLI YSSEKISNFS 
RSGGANTWLG NSLVLCTLLS WLRGLDYSY IWSGDDSLI FSRQFLDIDT 
RTGSPNTWLS NTLVTLGMML SSYDIDDIDL LLVSGDDSLI FSRKHLPNKT 



CONSENSUS rsG — NTW-G Ns lvtlg-ls ™y £~ 11VSGDDSLI fS- 



BYV_RdRp 
CTV_RdRp 
GLRa V3 _RdRp 
LIYVJldRp 

CONSENSUS 



DAMCTELGFE 
SEICLETGFE 
SVLSDNFGFD 
QEINKNFGHE 



VII 

TKFLTPSVPY FCSXFFVKTG 
TKFMSPSVPY FCSKFWQTG 
VKIFNQAAFY FCSXFLVQVE 
AKYIEKSSPY FCSXFIVELN 



VIII_ 



HDVF FVPDPY KLLVKLGAS. 
NKTCFVPDPY KLLVKLGAP. 
DSLFFVPDPL KLFVXFGAS. 
GXLKVTPDPI RFFEKLSIPI 



fGfe -Kf s-PY FCSKF-V fvPDP- kl-vKlga- 



BYV_RdRp . .KDEVDDEF LFEVFTSFRD LTKDLVDERV IELLTHLVHS KYGYESGDTY 

CTV RdRp . .QNKLTDVE LFELFTSFKD MTQDFGDQW LEXLKLLVEA KYGFASGTTM 

GLRaV3~RdRp . .KTSDID.L LHEIFQSFVD LSXGFNREDV IQEUVKLVTR KYX.HSGWTY 

LrYV^RdRp RQEDFVNGSV VXERFISFKD LMKEYDNDVA VXRIDEAVCY RYSIPVGCSY 

CONSENSUS d— 1-E-F-SF-D l-kdf--e-v i~l--lV-- kY sG-ty 



BYV_RdRp AALCAIHCIR SNFSSFKKLY 



CTV_RdRp 
GLRaV3_RdRp 
LIYV_RdRp 



PALCAIKCVR SNFLSFERLF 
SALCVLHVLS ANFSQFCRLY 
AALCYIKCCM SNFVSFRRIY 



CONSENSUS -ALC-iHc — sNF-sF-rly 
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TA3L2 9 

Nucleotide Sequence Comparison of GLRaV-3 and LIYV in 
Vicinity of Frameshif t 



GLAaV-IJHZL spqsvsdall 

CUUV-3_KdRp TRSPR^VAPCVY 
CLJUV-3 (atl TCg Ccc CAA TCa CTa tec GAc acILXTa^Se Ac* cgrT teg ccc ccg ctg gcT GCt TTc GAg era TAT 
LIYV (nt) TCa Ct» CAA TCr CTt *gt cxt- »~ ^ Amtr mcT atc , ct tta ffac arr ^ m ^ 

LITV.HEL S L Q S V SDFVL >**g TAT 

LITV^RdRp KTIILDSAFDVY 



TABLE 10 



Amino Acid Sequence Alignment of Small Hydroohobic 
Transmembrane Proteins of GLRaV-3 (p5k) and BYV (r>6X) 
CTV (p6K) and LIYV (pSK) . 



1 54 

HYV _p7K MDCVLRSYLL LAFGFLICLF LFCLWFIWF WSCQILTKIT AQSNEAHKKH STW 

LIYV.PSK MSILLFFL KSXLVWt 1±T IUXLFVNTD SZVNIFNKSR F* . . . 

GLR&V3 _p5K HDD FKQAIIXLW iii-uv.u I i ttt. VLTPWPRLQ QSSTIWrGLR TV* . . 

CTV_p6K MDCVIQGXT ILVCIAVFCA ERGLUTVTT IYacrTJO'VR SASFYCIHAT V* . . . 



CCHSZHSUS 



f U-f lvi-i— 
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TABLE 11 

Amino Acid Sequence Alignment of ASP70 -related protein 
of GLRaV-3 with BYV (p£5K) , CTV (p65K) and LIYV {p62K*) . 



BYV_p65 . . MWFGLDF G7TFSSVCAY VGEELYLFXQ RDSAYZPTYV FLESDTQEVA 

CTV_p65 , .MVLLGLDF GTTFSTVAMA TPSELVILXQ SNSSYIPTCL LLEAEPNSVS 

GLRaV3_p59 . . .MEVGXDF GTTFSTICFS PSGVSGCTPV AGSVYVETQI FZPEGSSTYL 

LXYV_po2 MRDCKVGLDF GTTFSTVSTL VNNSI4YVLRL GDSAYTPTCZ AITFGGSAX. 



CONSENSUS G1DF GTTFStv- 



-1—1- 



— S-YipTci f- 



. BYV_p65 FGYDAEVLSN DLSVRGGFYS. DLKRWIGCDE ENYRDYLZKL KPHTCTELLX 

CTV_p65 YGYDAEYLAA S.GZSGSFYX DLKRWVGCTA KNYQTYLHXL SPSYKVTVKE 

GLRaV3_p55 IG.KAAGXAY RDGVEGHLYV NPKRWVGVTR DNVERYVEXL KFTYTVXM. . 

LXYV_p62 IGGAAHVLSG DDTPHCFFY . DUCKWVGVDD NTFXrAHNKE RPKYVAELVE 



CONSENSUS * -G — Ae-1 g-fY- dlKRWvG ny — yl-Kl -P-y i m 



BYV_p65 VAQSSKSTVK LDCYSGTVTQ NATLPGLIAT FVKALISTAS EAFKCQCTGV 

CTV_p65 FGTXSVPVPY LSPLNNDLGL SVALPSLIAS YAXSILSDAE RVFNVSCTGV 

GI*RaV3_p59 ...DSGGALL IGGLGSGPDT LLRWDVTCL FLHALILECS RYTSTTVTAA 

LIYV_pS2 GEVY LTGXNKGFSI KLSVKQLIXA YXETXVRLIiA SSYSLRVTDL 



CONSENSUS 



-IX- 



B_ 

BYV_p65 ICSVPANYNC LQRSFTESCV NLSGYPCVYM VNEPSAAALS ACSRIKGATS 
CTV_p65 ICSVPAGYNT LQRAFTQQSX SMSGYSCVYI INEPSAAAYS TLPXLNSADK 
GLRaV3_p59 WTVPADYNS F3HSFWEAX* KGLGIPVRGV VNEPTAAALY SLAKSRVEDL 
LIYV_p62 NQSVPADYXN AQRLAARSVL KAJLSFPCRRI INEPSAAAVY CVSRYPNYNY 



CONSENSUS i-sVPA-Yn- IqR-f gypc— i -NEPsAAA- 



BYV_p65 FVLVYDFGGG TFDVSVISAI* NNTFWKASG GDMNLGGRDI DXAFVEHLYN 

CTV_p65 YLAVYDFGGG TFDVSIVSVR LPTFAVRSSS GDMNLGGRDI DKKLSDKXYE 

GLRaV3_p59 LLAVFDFGGG TFDVSFVK2GC GNILCVTFSV GDNFLGGRDI DRAXVEVTKQ 

LIYV _P 62 FL.VYDFGGG TFUVSLIGKY KSYVTVXDTE GDSFLGGRDI DKSIEDYLVG 



CONSENSUS -1-VyDFGGG TFDVS f-V— s- GD — LGGRDI Dk- 



BYV_p65 KAQ . . . LFVN YJCEDISFUCZ SLSKXVSFLN FPWSEQGVR VDVLVNVSZL 

CTV_p6S HAD...FVPQ KELNVSSLXE ALSLQTDFVK YT.VNHYGIiS ETVSIDQTVL 

GLRaV3_p59 KXKGKASDAK LGIFVSSMKE DLSNNNAITQ HLIFVEGGVE V.VDLTSDEL 

LXYV_p62 KYNHOCVTP. .ATYLALXKE E.CNNTNKSI FTH*FDDGSV QWEFSKSEL 



CONSENSUS Jc vs-IKE -la f-i— e-G V eL 



BYV_p65 AEVAAPFVER TIZIVKEVY. .ETYCSSMRL EPNVKAKLLM VGGSSYLPGL 

CTV j6S REIASVFXNR TIDILTQV. . . .SCVXSSHPE SQSL- -KLW VGGSSYLPGL 

CZJUV3 j59 DAZVAPFSAR AVTVFXTGP. . DNFYPDPVZ A VM TGGSSALVKV 

LZYVjM EKCVRPFVER SZXLI2XJWV R2I7XTSG" / XY^ VGGS3LLQPV 



CONSENSUS — i — pFv-R -i-i v Jc — s n vGGSS-L 
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HYV_p65 LSRLSSIPFV DEC.L.VL2D ARAAVAGGCA LYSACLKNDS PMLLVECAAK 

C7V_p65 LDALATVP5V SGI.V.PVED ARTAVARGCA LYSECLEGRS KALLIECZTH 

GLRaV3_p59 HSDVANLPQI SKV.VFDSTD FRCSVACGAX VYCDTLAGNS GLRLVDTLTN 

LrYV_p62 QDMVRSYAST KGLTLVADQD MRSAVSYGCS VLS.LEDNK EtVYXDCNSK 

CONSENSUS p-v D -R-aVa-Gc- -y L s 1-Dc h 



G 

BYV_p65 NLSISSXYCZ 
CTV_po5 HLSVTTFSAD 
GLKaV3 _p59 TLTDEWGLQ 
LIYV_p62 PLSDISFNCD 

CONSENSUS -Ls d 



SIVCVPAGSP IPFTGVRTVN 
SWVAAAGS? IFFEGERKLT 
FWTFPKGS? XPCSYTERYT 
PEPIITJCPMS IPYTHTVKMR 

-wi— -gsp IPf 



MTGSNASAVY SAALFEGDFV 
LRXCVSTSNY QARMFZGDYZ 

V GGGDV VYGIFEGE . . 

KDRPLKT. . . rVNTYEGSNL 

fEGd— 



H 

BYV_p65 KCRIaNKRTFF GDWLGNVGV TGSATRTVTL TLEINVSSVG TISFSLVGPT 
CTV_p65 KVFRNEHXYA ASVSLFTXGV NWSVPNDVEK TLVTKVDSMG KVEFYLXGPS 

GLRaV3_p59 UNHAFL NEPTKRGVSX RRGDPVETDV A.QFNLSTDG TVSVTVNGEE 

LXYV_p62 FMPENDWLIS SNXNTTDFAK VGZEY SKVYEYDIOG IITLKXRNEV 



CONSENSUS 



-N- 



-f g- 



BYV_p65 GVKXLIGGNA AYDFSSYQLG ERWADLKKK NSDKVXLIEA LTYQPFQRKK 

CTV_p65 GZLVNVQGTS HYDYAGHPHP THKLVKLSDY NVNSAAXVLA LTLTREHREK 

GI»RaV3_p59 VKNEYLVPGT TNVLDSL VYXSGKE DLEAKAIPEY LTTLNILHDK 

LXYV_p62 TGXMFTLPNS FTXSDNIXPI TFKLTQLSNT D . DLATL.TSL LGYHDKNFER 



CONSENSUS 



-ek 



BYV_p65 LTDGDKALFL KSLTADYKRE 

CTV_p6S FLLKT . . . LF DTLLADLRKT 

GLRaV3_p59 AFTRRNLGNK DKGr SDLRXE 

LXYV_p62 FYG L FNVPTILIZE 

CONSENSUS f 1 dlr-e 



AHKFSST DDAV LNSSEIXLGR 

A.SLSEYSKX YPITRNDIEV VSSK. . .KGI 

ENFUCS AVDTDTI LNG* 

XDKLGGr KTlt YKRLKSMNAN F 



BYV_p6S IIPKILRGSR VEKLDV* 

CTV_p65 WSKVLRCSD LERXPL. 

GULaV3_pS9 

LIYV_p62 1 

CONSENSUS 
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TABLE 12 

Amino Acid Sequence Alignment of HSP90-like Protein 
(p55) of GLRaV-3 with p61K of CTV, p59 of LIYV and P64 
of BYV. 



BYV_p64 llSJVGCKFNIQSVTKt V KJLEJGNVAEP SLVEH07SLSNSCGELINPKDTKRFV 

CTV_p 5 ^ 108 ~VUCRFTIjNDVESYI*M£RGZDFA^ 

LIYV_P 5 9 13 iZeGCSFTEQQVVZKYPQVDSLVAKIL YRVCNSLGXLLDLKDFENKN 

GLRaV3_p55 114lvDSNLPK3aDRDDXME. . ASRRLS? SDAAFCRAVSVQVGKYVDVTQNLEST ' 

CONSENSUS vgc- f v-e a w — sns -g- 1 d 



BYV_p64 
CTV_p61 
LIYV_P.S9 
CLRaV3_p55 

CONSENSUS 



SLXTKGKDLAESTDEAXVS - . SSYLDYLSHCLNLYETCNLSSNSGJCKSLY 
TLVF . TKNFDSNISG . -VT. . TKLETYLS YC I S L YKKKCM . KDDDYFNLI 
ISGFEXNTAQDSPTV3UDDN . .ES.NDFFRECVNDQRYYSSLSGSXLGKAK 
IVPI^VMEIKKHRGSAHVSIiPKWSAYVDFYTNI^ELLS 



-yl- 



BYV_jp 6 4 
CTV_p$l 
LIYV.PS9 
GLRaV3_p55 

CONSENSUS 



DEFLKKVTDYL. . • ENSDLEYHS ? SDN P LVAG ILYDMCFEYNTLKSTYXJC 
LPSlFNCUlKVLt . . . ASI^FYEKHADNPLLTGZILIEFCLENKVYYSTFKV 
LEANAYTFKIIiLKSASGEFDIDRLSRNP LAI SKFMNLYTNHVTDS ETFKS 
SAYATDSHAFLVKKLPLT AREQWLKDVLGYLLVRRRPANFSYDV 



-npl- 



-1— lc- 



BYV_pS4 
CTV_p61 
LIYV_PS9 
GLR&V3 _pS5 

CONSENSUS 



NTESFDCFLSI.VLPLLSEVFSMNWERPAPDVRLLFEI.DAAELLLiCVPTIN 
NLDNVrU^FKSKVT-PVVLTVWDISEPDDPVDERVLIPFDPTDFVLDLPKLN 

KFEALKSXKrPFASFIKXAFGIR LNFEDSXIFYALPKER 

RVAWVYDVXATLKLVJRrJFFNKDTPGGIKDLKPCVPIESFDPFKELS • . . 



-d-f--lp- 



BYV_p64 
CTV_p61 
LIYV.P59 
GLRaV3_p55 

CONSENSUS 



MHDST . . . FX.Y5OTO^YLESYFH3DSNEI*XKVTCVDSLL 

IHDTM VWGNQIRQLEYWESDALDDLSQHVDLRL 

QSDVLSDDMMVESXVTOAASFTWSDNNYLPH^VDR^ 
. . SYFSRLSYEKTTGKGGKICPEIAEKL 



-vd--l 
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TABLE 13 

Amino Acid Sequence Alignment of ^^"-J^"*, 
Protein of GLRaV-3 with BYV p€4K, CTV p61K and Lirv 

p59K. 



BYV nfil MTTRFSTPAN YYWGELFRRF FGG<3EW KNLMSE AASVSRPRYS 

Shrill ^ MSSH HVWGSLFRKF YGEAIW KEYLSE STRNFDERNV 

LIYVPS9 " *MLNDRXAV TCFQTLLXKS NVKHEMSQTN NYIVNNIADI NROTFPALAC 

GLRaV3lpS5 .MDK YXYVTGI . - L NPNEARDEVF SWNZGYIG? GGRSFSNRGS 



CONSENSUS 



-If— f 



BYV ofil S DFRTSDGV ILSRXTFGES TGES..FVRE FSLLLTFPKT YEVCXLCGVA 

?Z-Z 6 l SLDHTLSSGV VVRRQSLLNA PQGT . . FENE LALLYNSWT NDFVELTGMP 

L SJi59 KSSfJsDY YISGCQIWS PKDSNAYVKL LIVYLKYrYX N.YSAKTKYP 

CLRa^IpSS KYTVVWEN. . . . SAARISGF TSTSQSTIDA FAYFL LKGGLTTT 



CONSENSUS 



-fv- 



-11- 



BYV_p61 
CTV_p61 
LIYV_P59 
GLRaV3_p5S 



HELALNGHN. .RLSDYNVSE FN IV DVKTVCCKFN IQSVTEFVKK 

SsSciSD RKVPD....E LI SV DPHEVGCRFT LNDVESYLMS 

poslSvldy dsfkakwvky ldjcsltdyld dnktegcsft eqqwexypq 

LSNPINCENW VRSSXDLSAF FRTLIKGTOY ASRSVDSNLP KKDRDDIME. 



CONSENSUS 1 — l r 



d vgc-f- v-e 



BYV_p61 
CTV_p61 
LIYV_PS9 
GLRaV3_p5S 

.CONSENSUS 



INGNVAEPSL VEKCWSLSNS CGELINPKDT KRFVSLIFKG KDLAESTDc--. 

rgzdfadlaa vehswclsns CSRLLSSTEI DANKTLVF.T KNFDSNISG. 

Sds^vST -yrvcns lgxlldlkdf enknisgfei NTAQOSPTVA 

^Srhlspsd aafcsavsvq vgkyvdvtqn lestivplkv MEIKXRRGSA 



-sns 



BYV_pfil 
CTV_p61 
LIYV_P59 
GLRaV3_p55 



XVS..SSYLD YLSHCLNLYE 
.VT..TKLET YLSYCISLYK 
DDN..ES.ND FFRECVNDQR 
HVSLPKWSA YVDFYTNLQE 



TCNLSSNSGK KSLYDEFLKH VIDYL. - .EN 
KHCM.KDDDY. FNLILPMFNC LMKVL . . - AS 
YYSSLSGSXL GKAKLEANAY IFKXLLKSAS 
LLSDEVTRAR TDTVSAYATD SMAFLVKMLP 



CONSENSUS -V- 



yl — c-nl : 



BYV_p61 
CTV_p61 
LITV_P59 
GLRaV2_p55 



<TOT,™SPSD NPLVAGXLYD MCFEYOTLKS TYLXNXESFD CFLSLYLPLL 
NPLLTGMLIE FCLENKVYYS TFKVNLDNVR LFKSXVLPW 
^SSSr nplaxskfmn lytnhvtdse TFKSKFEALK SXKTPFASFI 
G^FDIDRLSH ™ LLVRRRPANF SYDVRVAWVY DVXATLKLVI 



CONSENSUS — 1- 



— npl 1" ^ 



BYV_p61 
CTV_p6L 
LIYV_PS9 
GL.RaV3_p55 

CONSENSUS 



SZVFSMNWE^ PAFDVRLLFE LDAAELLLKV PTINMHDST. ..FLYKNKLR 
SiS^S D^/DERVLI? FDPTDFVLDL PKUJIHDTM. . .VWGNQIR 
SafSS - ^ ..-LN FSDSKIFYAL PKEKQSDVLS DDMMVESXVR 
RLFFNKCTFG GI2CDLXPCVP IESFDPFHEL S 

* H £— d-« — 1 P d 



.SYFS 
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BYV_p61 YLESYFEDDS NELZKVKVDS LLTRDNPEL . , KLAQRWV GFHCYYG 

CTV_p61 QLEYWESDA LDDLSQHVDL RLAADNPDL. .RVGLRWA GMFVYYG 

LIYV_P59 DAASFTWSD NNYLPERVDR FVTQLLLELF PKTXASFPNK IMFGFLHYFA 

GLRaV3_p55 RLSYEMTTGX GGXICPEIAE KLVRRLMEEN YXLRLT.PVM ALXIILVYYS 



CONSENSUS -1 



— VG- 



—el- -k w g-l-Yy- 



II 

BYV_p61 VFRTAQTRKV KRDAEYXLPP AL GE FVTNMSGVEE FF . EELQKKM 

CTV_p61 VYRCWDRAV ERPTLFRLPQ KLLSQDDGES CSLHMGSVEA LF . NLVQKVN 

LIYV.P5 9 LSTTNSXR FNCTQ ESTIEIEGET LXISLKrITS YLRNAIQSQH 

GLRaV3_p55 IYGTNATRIK RRPDFLNVRI KGRVE KVSLRGVED . . RAFRISEK 

CONSENSUS vy-t R — -r f i ve- -£ q 



BYV_p6l PSI...SVRR RFCGSLSHEA FSVFKRFGVG FPFITRLNVP VKYSYLNVDY 

CTV_p61 KDI...NVRR QFMGRHSEVA LRLYRNLGLR FPPISSVRLP AHHGYLYVDF 

LIYV_?59 PDYADSNIVR LWCNKRSNLA LGY7XSRNIQ LYLYS..KYP RLLNYMRFDY 

GLRaV3_p56 RGINAQRVLC RYYSDLTCIiA RRHYGIRRNN WKTLSYVD. . GTLAYDTADC 



CONSENSUS 



-r -fc s — A 1 f s p Yl — Dy 



BYV_p61 YRHVKRVGLT QDELTILSNI EFDVAEMCCE REVALQARRA QR GEKP 

CTV_p61 YXRVPDGAVT ADEL£SI*RQL RSSVDVMCKD R.VSITPPPF NRLRRGSSRT 

I.IYV.P59 FKGLDMGKLT DEERLSIQTL RCITEDRS . E GTLATHNDLN SWILRP. 

GLRaV3_p55 ITSKVRNTIN TADHASIIHY IXTNENQVTG TTLPHQL* 



CONSENSUS y t -de — s e e 



BYV_p61 FQGWKGTKNE ISPHARSSIR VKXNNDSLLN ILWKDVGARS QRRLNPLHRK 

CTV_p61 FRGR.GARGA SSRHMSRDVA TSGFNLPYHG RLYSTS* 

LIYV_P59 

GLRaV3_p55 

CONSENSUS 



BYVjfil H" 

CTV_p61 . . 

LIYV.P59 . . 
GLRaV3_p55 

CONSENSUS — 
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TABLE 14 

Nucleotide and Deduced Amino Acid Sequences of the 
Partial HS?90-Related Gene of GLRaV-3 Adapted for 
Cloning into Plant Expression Vectors. 



(5 • primer, 93-224) 
Ncol 

tac ttatctaaaa cc ^» 

" ^ ^G AAGCGAGTCGACGACTA 

AT GGAAGCGAGTCGACG ACTAT CGC CATGG GACGC CGC CTTTTGCAGAGC AGTGT CGGTT 
KEASRRLSPSDAAFCRAVSV 
CAGGTAGGGAAGTATGTGGACG T AACG CAG AATTT AGAAAGTACG ATC GTGCC GTTAAGA 

QVGKYVDVTQNLESTIVPLR - 
GTTATGGAAATAAAGAAAAGACGAGGATCAGCAC^TGTTAGTTTACCGAAGGTGGTATCC 

VMEIKXRRGSAHVSLPKVVS 
GCTTACGTAG ATTTTTATACGAACrTGCAGGAATTGCTGT CGG ATGAAGT AACTAGGGCC 

AYVDFYTNLQELLSDEVT RA - 
AG AAC CG ATACAGTTTCGGCATACG CTACC GACTC TATGG CTTTCTTAGTTAAGATGTTA 

RTDTVSAYATDSMAFLVKML 
CCCCTGACTGCTCGTGAGCAGTGGTTAAAAGACGTGCTAGGATATCTGCTGGTACGGAGA 



PL-TAREQWLKDVLGYLLVRR 
OG ACC AGCAAATTTTTCCTACC AOGTAAGAGT AGCTTGGGTATATGACGT GATCG CTACG 



RPANFSYDVRVAWVYDVIAT 
^CAAGCTGGTCATAAGATTGTTTTTCAACAAGGACACACCCGGGGGTATTAA^ 



lklvirlffnkdtpggixdl - 
aaaccgtgtgtccctaxax^gtcattcgacccctt^^ 

kpcvpiesfdpfhelssyfs 
acgttaagttacgagatgacgacaggtaaagggggaaagatatgcccggagatcgccga^ 

rl5yemttgkggkicpeiae 
aagtrggtgcgccctcraatggaggaaaactataagtraagattgaccccactgatggcc 

klvrrlmeenyklrltpvma 
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TTAATAATTATACTCGTATACTACTCCATTTACGSCACAAACGCTACCACGATTAAAAGA 

LIIILVYYS IYGTNATRIKR - 

CCCCCGGATTTCCTCAATGTGAGGATAAAGGGAACAGTCGAGAAGCTTTCGTTACGGGGG 

RPOFLNVRIKGRVEKVSLRG - 

CTAGAAGATCGTGCCTTTACAATATCAGAAAAGCGCGGGATAAACCCTCAACGTGTATTA 

VEDRAFRISEKRGINAQRVL - 

TGTAGGTACTATAGCGATCTCACATGTCTGGCTAGGCGACATTACGX^ATTCGCAGGAAC 

CRYYSDLTCI*ARRHYG IRRN - 

AATTCGAAGACGCTCAGTTATGTAGACGGGACGTTAGCGTATGACACGGCTGATTGTATA 

NWKTLSYVDGTLAYDTADCI - 

ACTTCTAAGGTGAGAAATACGATCAACACCGCAGATCACGCTAGCATTATACACTATATC 

TSKVRNT INTADHAS I IHYI - 

AAGACGAACGAAAACCAGCTTACCCGAACTACTCTACCACACCAGCTTTAAAGCTGCGTG 

XTNENQVTGTTLPHQL * 

TAGTATGCGACGATGTTTCr 

„ < 10503 

ATCAT ACG CTC CTACAAAGA"" 




;Zaggagttct 
Ncol 

(3' primer, 93-225) 
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TABLS 15 

Nuc 1 °oc f de and Deduced Amino Acid Sequences of a SQ- 
amDlified Fragment of the GLRaV-3 Genome External and 
Internal Primes are underlined and their orientations 
are indicated by arrows. 



vDSJr'LP :<:<o?- dd^h^^s?.^^ 

- . , _,,,_. r ^« ~rs^CCST^AAGAGTTATGGAAATAAAGAAAAG 
CGTAACGCAG AATTTAGAA^GT.n.C ^ . C * 

12 i (23-25) < H •* *• 180 

VTQNLE S T IV?LRVM*IK*R 

acgaggatc^cacxtcttagt^akg^g^ 2 4Q 

181 ^c-Ig^aca^ca^^ 

rgsahv s l pkvvsayvdfyt 

GAACTTCCAGGAATTGCTC^CCGATCAACTAAC^^ 3Q0 

241 "11^171/-.-* ^-^^r^C^TTGATCCCGC^CrrCGCTATGTCAAAGCCG 

CTTGAACGTCCTT^AC-AC.G.^CTT^^ ^RTDTVSA — 



M L Q E L L S 



301 ^gI^aga^acc^^ . 

YATOSMAFLVKMLPLTAR-Q 

_ . . , „Ttr-CC-AGTACGCAGACGACCAGCAAAr:"rrTCCTA 

GTGCTTAAAAGACGTGw.«G««-ATCTGs.-»w-.«v- ^ „. ^ 420 

421 GC^^CGAACCCA^AC^^ 

D.VRVAWVYDVIATLKI.VIRL 
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CLAIMS : 

1. An isolated DNA molecule encoding a protein or 
polypeptide of a grapevine leaf roll virus. 

5 

2. An isolated DNA molecule according to claim 1, 
wherein the protein or polypeptide is selected from a group 
consisting of a helicase, an RNA-dependent RNA polymerase, an 
hsp70-related, an hsp90-related, a coat protein or 

10 polypeptide, coat protein repeat, ORF8 (p21), ORF9 (P20) , 
ORF10 (P20) and ORF11 (p7) . 



3. An isolated DNA molecule according to claim 2, 
wherein the protein or polypeptide is a helicase having a 
15 molecular weight of from about 146 to about 151 kDa. 



20 



4. An isolated DNA molecule according to claim 3, 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ ID NO:2. 

5. An isolated DNA molecule according- to claim 4, 
wherein the DNA molecule comprises a nucleotide sequence 
corresponding to SEQ ID NO : 1 . 

25 6. An isolated DNA molecule according to claim 2, 

wherein the protein or polypeptide is an RNA-dependent RNA 
polymerase having a molecular weight of from about 59 to about 
63 kDa. 



30 7. An isolated DNA molecule according to claim 6, 

wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ ID NO: 4. 

8. An isolated DNA molecule according to claim 7, 
35 wherein the DNA molecule comprises a nucleotide sequence 
corresponding to SEQ ID NO: 3. 
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9. An isolated DNA molecule according to claim 2, 
wherein the protein or polypeptide is an hsp70-related protein 
or polypeptide having a molecular weight of from about 57 to 
about 61 kDa. 

10. An isolated DNA molecule according to claim 9, . 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ ID NO: 6. 

11. An isolated DNA molecule according to claim 10, 
wherein the DNA molecule comprises a nucleotide sequence 
corresponding to SEQ ID NO: 5. 

12. An isolated DNA molecule according to claim 2, 
wherein the protein or polypeptide is an hsp90-related protein 
or polypeptide having a molecular weight of from about 53 to 
about 57 kDa. 

13. An isolated DNA molecule according to claim 12, 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ ID NO: 8. 

14. An isolated DNA molecule according to claim 13, 
wherein the DNA molecule comprises a nucleotide sequence 
corresponding to SEQ ID NO: 7. 

15. An isolated DNA molecule according to claim 2, 
wherein the protein or polypeptide is a coat protein or 
polypeptide having a molecular weight of from about 33 to 
about 43 kDa. 

16 . An isolated DNA molecule according to claim 15, 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ ID NO: 10. 
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17. An isolated DNA molecule according to claim 16, 
wherein the DNA molecule comprises a nucleotide sequence 
corresponding to SEQ ID NO: 9. 

18. An isolated DNA molecule according to claim 1, 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ ID NO: 12. 

19. An isolated DNA molecule according to claim 18, 
wherein the DNA molecule comprises a nucleotide sequence 
corresponding to SEQ ID NO: 11. 

20. An isolated DNA molecule according to claim 1, 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ ID NO: 14. 

21. An isolated DNA molecule according to claim 20, 
wherein the DNA molecule comprises a nucleotide sequence 
corresponding to SEQ ID NO: 13. 

22. An isolated DNA molecule according to claim 1, 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ ID NO: 16. 

23. An isolated DNA molecule according to claim 22, 
wherein the DNA molecule comprises a nucleotide sequence 
corresponding to SEQ ID NO: 15. 

24. An isolated DNA molecule according to claim 1, 
wherein the protein or polypeptide comprises an amino acid 
sequence corresponding to SEQ ID NO; 18. 

25. An isolated DNA molecule according to claim 24, 
wherein the DNA molecule comprises a nucleotide sequence 
corresponding to SEQ ID NO: 17. 
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26. An expression system comprising a DNA segment 
corresponding to a DNA molecule according to any of claims 1 
to 25 in a vector heterologous to the DNA molecule and wherein 
a nucleotide sequence encoding a protein or polypeptide of a 
grapevine leaf roll virus is operatively linked to nucleotide 
sequences which direct the expression of said protein or 
polypeptide . 



10 27. A host cell transformed with a heterologous 

expression system according to claim 26. 

28. A host cell according to claim 27, wherein the host 
cell is selected from the group consisting of Agrobacterium 

15 vitis and Agrobacterium tumefaciens . 

29. A host cell according to claim 28, wherein the host 
cell is a grape cell or a citrus cell. 

20 30. A transgenic Vitis scion cultivar or rootstock 

cultivar comprising a DNA sequence encoding a protein or a 
polypeptide of a grapevine leaf roll virus according to any of 
claims 1-25. 

25 31. A transgenic Vitis scion cultivar or rootstock 

cultivar according to claim 30, wherein the Vitis scion or 
rootstock cultivar is a scion cultivar selected from the group 
consisting of Vitis vinifra, Alden, Almeria, Anab-E-Shahi. 
Autumn Black, Beauty Seedless, Black Corinth, Black Damascus, 

30 Black Malvoisie, Black Prince, Blackrose, Bronx Seedless, 

Burgrave, Calmeria, Campbell Early, Canner, Cardinal, Catawba, 
Christmas, Concord, Dattier, Delight, Diamond, Dizmar, 
Duchess, Early Muscat, Emerald Seedless, Emperor, Exotic, 
Ferdinand de Lesseps, Fiesta, Flame seedless, Flame Tokay, 

35 Gasconade, Gold, Himrod, Hunisa, Hussiene, Isabella, Italia, 
July Muscat, Khandahar, Katta, Kourgane, Kishmishi, Loose 
Perlette, Malaga, Monukka, Muscat of Alexandria, Muscat Flame, 
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Muscat Hamburg, New York Muscat, Niabell, Niagara, Olivette 
blanche, Ontario, Pierce, Queen, Red Malaga, Ribier, Rish 
Baba, Romulus, Ruby Seedless, Schuyler, Seneca. Suavis (IP 
365), Thompson seedless, Aleatico, Alicante Bouschet, Aligote, 
Alvarelhao, Aramon, Baco blanc (22A) , Burger, Cabernet franc. 
Cabernet, Sauvignon, Calzin, Carignane, Charbono, Chardonnay, 
Chasselas dore, Chenin blanc, Clairette blanche, Early 
Burgundy, Emerald Riesling, Feher Szagos, Fernao Pires, Flora, 
French Colombard, Fresia, Furmint, Gamay, Gewurztraminer, 
Grand noir, Gray Riesling, Green Hungarian, Green Veltliner, 
Grenache, Grille Helena, Inzolia, Lagrein, Lambrusco de 
Salamino, Malbec, Malvasia bianca, Mataro, Melon, Merlot, 
Meunier, Mission, Montua de Pilas, Muscadelle du Bordelais, 
Muscat blanc, Muscat Ottonel, Muscat Saint-Vallier , Nebbiolo, 
Nebbiolo fino, Nebbiolo Lampia, Orange Muscat, Palomino, Pedro 
Ximenes, Petit Bouschet, Petite Sirah, Peverella, Pinot noir. 
Pinot Saint-George, Primitivo di Gioa, Red Veltliner. Refosco, 
Rkatsiteli, Royalty, Rubired, Ruby Cabernet, Saint -Emi lion, 
Saint Macaire, Salvador, Sangiovese, Sauvignon blanc, 
Sauvignon gris, Sauvignon vert. Scarlet, Seibel 5279, Seibel 
9110, Seibel 13053. Semillon, Servant, Shiraz, Souzao, Sultana 
Crimson, Sylvaner, Tannat, Teroldico, Tinta Madeira, Tinto 
cao, Touriga, Traminer, Trebbiano Toscano, Trousseau, 
Valdepenas, Viognier, Walschriesling, White Riesling, and 
Zinf andel . 

32. A transgenic Vitis scion cultivar or rootstock 
cultivar according to claim 30. wherein the Vitis scion or 
rootstock cultivar is a rootstock cultivar selected from the 
group consisting of Couderc 1202, Couderc 1613, Couderc 1616, 
Couderc 3309. Dog Ridge, Foex 33 EM, Freedom, Ganzin 1 (A x R 
#1), Harmony, Kober 5BB, LN33, Millardet & de Grasset 41B, 
Millardet & de Grasset 420A, Millardet & de Grasset 101-14, 
Oppenheim 4 (S04) , Paulsen 775, Paulsen 1045, Paulsen 1103, 
Richter 99. Richter 110. Riparia Gloire, Ruggeri 225. Saint - 
George, Salt Creek, Teleki 5A, Vitis rupestris Constantia, 
Vitis California, and Vitis girdiana. 
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33. A method of imparting grapevine leaf roll virus 
resistance to a Vitis scion cultivar or rootstock cultivar 
comprising: 

5 transforming a Vitis scion cultivar or rootstock cultivar 

with a an expression system according to claim 26 wherein said 
vector is a plant transformation vector. 

34. A method according to claim 33, wherein the Vitis 
10 scion cultivar or rootstock cultivar is a scion cultivar 

selected from the group consisting of Vitis vinifra, Alden, 
Almeria, Anab-E-Shahi, Autumn Black, Beauty Seedless, Black 
Corinth, Black Damascus, Black Malvoisie, Black Prince, 
Blackrose, Bronx Seedless, Burgrave, Calmeria, Campbell Early, 

15 Canner, Cardinal, Catawba, Christmas, Concord, Dattier, 
Delight, Diamond, Dizmar, Duchess, Early Muscat, Emerald 
Seedless, Emperor, Exotic, Ferdinand de Lesseps, Fiesta, Flame 
seedless, Flame Tokay, Gasconade, Gold, Himrod, Hunisa, 
Hussiene, Isabella, Italia, July Muscat, Khandahar, Katta, 

20 Kourgane, Kishmishi, Loose Perlette, Malaga, Monukka, Muscat 
of Alexandria, Muscat Flame, Muscat Hamburg, New York Muscat, 
Niabell, Niagara, Olivette blanche, Ontario, Pierce, Queen, 
Red Malaga, Ribier, Rish Baba, Romulus, Ruby Seedless, 
Schuyler, Seneca, Suavis (IP 365), Thompson seedless, 

25 Aleatico, Alicante Bouschet, Aligote, Alvarelhao, Aramon, Baco 
blanc (22A) , Burger, Cabernet franc, Cabernet, Sauvignon, 
Calzin, Carignane, Charbono, Chardonnay, Chasselas dore. 
Chenin blanc, Clairette blanche, Early Burgundy, Emerald 
Riesling, Feher Szagos, Fernao Pires, Flora, French Colombard, 

30 Fresia, Furmint, Gamay, Gewurztraminer, Grand noir, Gray 

Riesling, Green Hungarian, Green Veltliner, Grenache, Grille 
Helena, Inzolia, Lagrein, Lambrusco de Salamino, Malbec, 
Malvasia bianca, Mataro, Melon, Merlot, Meunier, Mission, 
Montua de Pilas, Muscadelle du Bordelais, Muscat blanc, Muscat 

35 Ottonel, Muscat Saint -Vallier, Nebbiolo, Nebbiolo fino, 

Nebbiolo Lampia, Orange Muscat, Palomino, Pedro Ximenes, Petit 
Bouschet, Petite Sirah, Peverella, Pinot noir, Pinot Saint- 
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George, Primitive di Gioa, Red Veltliner, Refosco, Rkatsiteli, 
Royalty, Rubired, Ruby Cabernet, Saint-Emilion, Saint Macaire, 
Salvador, Sangiovese, Sauvignon blanc, Sauvignon gris, 
Sauvignon vert, Scarlet, Seibel 5279, Seibel 9110, Seibel 
13053, Semillon, Servant, Shiraz, Souzao, Sultana Crimson, 
Sylvaner, Tannat, Teroldico, Tinta Madeira, Tinto cao, 
Touriga, Traminer, Trebbiano Toscano, Trousseau, Valdepenas, 
Viognier, Walschriesling, White Riesling, and Zinfandel. 

35. A method according to claim 33, wherein the Vitis 
scion cultivar or rootstock cultivar is a rootstock cultivar 
selected from the group consisting of Couderc 1202, Couderc 
1613, Couderc 1616, Couderc 3309, Dog Ridge, Foex 33 EM, 
Freedom, Ganzin 1 (A x R #1) , Harmony, Kober 5BB, LN33, 
Millardet & de Grasset 41B, Millardet & de Grasset 420A, 
Millardet & de Grasset 101-14, Oppenheim 4 (S04) , Paulsen 775, 
Paulsen 1045, Paulsen 1103, Richter 99, Richter 110, Riparia 
Gloire, Ruggeri 225, Saint-George, Salt Creek, Teleki 5A, 
Vitis rupestris Constantia, Vitis California, and Vitis 
girdiana . 

36. A method according to claim 33. wherein the 
grapevine leafroll virus is selected from the group consisting 
of GLRaV-1, GLRaV-2, GLRaV-3 , GLRaV-4 , GLRaV-5, and GLRaV-6. 

37. A method according to claim 33, wherein said 
transforming is Agrobacteri urn-mediated. 

38. A method according to claim 37, wherein said 
transforming comprises: 

contacting tissue of the Vitis scion cultivar or 
rootstock cultivar with an inoculum of bacterium of the genus 
AgroJbacterium transformed with an expression construct 
according to claims 26 and regenerating a transformed plant. 
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39. A method according to claim 38, wherein the tissue 
is selected from the group consisting of leaf tissue, root 
tissue, meristems, zygotic and somatic embryos, and anthers. 

5 40. A method according to claim 33, wherein said 

transforming comprises: 

propelling particles at grape plant cells under 
conditions effective for the particles to penetrate into the 
cell interior and 
10 introducing an expression vector comprising the DNA 

molecule into the cell interior. 

41. A method according to claim 40, wherein the vector 
is associated with the particles, whereby the vector is 

15 carried into the cell interior together with the particles. 

42. A method according to claim 41, wherein the vector 
surrounds the cell and is drawn into the cell by the 
particle's wake. 
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43 . A transgenic citrus scion cultivar or rootstock 
cultivar transformed with an expression system according to 
claim 26, wherein said vector is a plant transformation 
vector. 



44. A transgenic citrus scion cultivar or rootstock 
cultivar according to claim 43, wherein the citrus scion 
cultivar or rootstock cultivar is selected from the group 
consisting of lemon, lime, orange, grapefruit, pineapple, and 

30 tangerine. 

45. A transgenic citrus scion cultivar or rootstock 
cultivar according to claim 43, wherein the citrus scion 
cultivar or rootstock cultivar is selected from the group 

35 consisting of Joppa, Maltaise Ovale, Parson (Parson Brown), 

Pera, Pineapple, Queen, Shamouti, Valencia, Tenerife, Imperial 
Doblefina, Washington Sanguine, Moro, Sanguinello Moscato, 
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Spanish Sanguinelli, Tarocco, Atwood, Australian, Bahia, 
Baiana, Cram, Dalmau, Eddy, Fisher, Frost Washington, 
Gillette, LengNavelina, Washington, Satsuma Mandarin, Dancy, 
Robinson, Ponkan, Duncan, Marsh, Pink Marsh, Ruby Red, Red 
Seedless, Smooth Seville, Orlando Tangelo, Eureka, Lisbon, 
Meyer Lemon', Rough Lemon, Sour Orange, Persian Lime, West 
Indian Lime, Bearss, Sweet Lime, Troyer Citrange, and Citrus 
trifoliata . 

46. A method for producing a virus -resistant citrus 
scion cultivar or rootstock cultivar comprising: 

transforming citrus tissue with an expression vector 
according to claim 26 . 

47. A method according to claim 46, wherein said 
transforming is Agrobacteri urn-mediated. 

48. A method according to claim 46, wherein the tissue 
is selected from the group consisting of leaf tissue, root 
tissue, meristems, zygotic and somatic embryos, and anthers. 

49. The method of claim 46 wherein said expression 
vector is introduced into plant cells by a biolistic method. 

50. An antibody or binding portion thereof or probe 
recognizing the protein or polypeptide encoded by the DNA 
molecule according to any of claims 1 to 25. 

51. An isolated protein or polypeptide corresponding to 
a protein or polypeptide of a grapevine leaf roll virus encoded 
by a DNA molecule according to any of claims 1-25. 

52. A method for detection of grapevine leaf roll virus 
in a sample, said method comprising: 

providing an antibody or binding portion thereof 
recognizing the encoded protein or polypeptide according to 
any of claims 1 to 25; 
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contacting the sample with the antibody or binding 

portion thereof; and 

detecting any reaction which indicates that grapevine 
leaf roll virus is present in the sample using an assay system. 



53. A method according to claim 55 # wherein the assay 
system is selected from the group consisting of an enzyme- 
linked immunosorbent assay, a radioimmunoassay, a gel 

10 diffusion precipitin reaction assay, an immunodiffusion assay, 
an agglutination assay, a fluorescent immunoassay, a protein A 
immunoassay, and an Immunoelectrophoresis assay. 

54. A method for detection of grapevine leaf roll virus 
in a sample, said method comprising: 

15 providing a nucleotide sequence of the DNA molecule 

according to any of claims 1 to 25 as a probe in a nucleic 
acid hybridization assay; 

contacting the sample with the probe; and 
detecting any reaction which indicates that grapevine 
20 leaf roll virus is present in the sample. 

55. A method for detection of grapevine leafroll virus 
in a sample: 

providing a nucleotide sequence of the DNA molecule 
25 according to any of claims 1 to 25 as a probe in a gene 

amplification detection procedure; 

contacting the sample with the probe; and 

detecting any reaction which indicates that grapevine 

leafroll virus is present in the sample. 



30 



AN 1996:479806 CAPLUS 
DN 125:138016 

TI Coat protein gene identification, genome organization, and per 

detection of grapevine leafroll associated closterovirus-3 and study 
towards transgenic grapevines (vitis) 

AU Ling, Kai-Shu 

CS Cornell Univ., Ithaca, NY, USA nnQ „. QCft 
SO (1995) 232 pp. Avail.: Univ. Microfilms Int., Order No. DA9624860 

From: Diss. Abstr. Int., B 1996, 57(3), 1539 
DT Dissertation 
LA English 

CC 11-5 (Plant Biochemistry) 
AB Unavailable 

ST closterovirus grapevine leafroll gene coat protein; genome 

organization grapevine leafroll virus 
IT Genome 

Grape . • , 

(coat protein gene identification, genome organization, and per 
detection of grapevine leafroll assocd. closterovirus-3 and study 
towards transgenic grapevines) 
IT Gene, plant # . _ 

RL: BOC (Biological occurrence); PRP (Properties); BIOL (Biological 

study) ; OCCU (Occurrence) 

(coat protein gene identification, genome organization, and per 
detection of grapevine leafroll assocd. closterovirus-3 and study 
towards transgenic grapevines) 
IT Virus, plant 

(clostero-, coat protein gene identification, genome 
organization, and per detection of grapevine leafroll assocd. 
closterovirus-3 and study towards transgenic grapevines) 
IT Proteins, specific or class 

RL: BOC (Biological occurrence); PRP (Properties); BIOL (Biological 
study); OCCU (Occurrence) 

(coat, coat protein gene identification, genome organization, and 
per detection of grapevine leafroll assocd. closterovirus-3 and 
study towards transgenic grapevines) 

IT Virus, plant 

(leafroll, coat protein gene identification, genome organization, 
and per detection of grapevine leafroll assocd. closterovirus-3 
and study towards transgenic grapevines) 



